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(57) Abstract 

A novel mouse polypeptide applicable to the treatment of diseases in which abnormal proliferation of smooth muscle participates, for 
example, arteriosclerosis and myoma, because of having an effect of inhibiting the proliferation of vascular smooth muscle cells. Moreover, 
this polypeptide has hematopoietic cell-regulatory activity, tissue generation/reparation activity, activin/inhibin activity, taxis/chemotaxis 
activity, blood coagulation and thrombus activity, receptor/Iigand activity, etc. and, therefore, is expected as useful in preventing and/or 
treating various diseases. 
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25 A^ins. ^^^-iLTIl or i Mis&ch, i&^lCcfc 0 ±32c 

D N A <D5£1%<Dfzl£>iD 7° D ^B— 5? — , 7 □ ^— ^ — GDfaJPH^fc <*: rt» <=> & -5 




5 



PCT/JPW02283 



WO 99/55863 



10 



15 



20 



25 



WO 99/55863 PCT/JP99/02283 



(#J*_«\ Metlilii, Leutt6ll) ^ObtlTUSo fol, 
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s « 9 t^k $ n & # u ^ y 9- f & 3 - h -r 5 -r ^xiE^jp^f? £ n 

15 #&-5„ 

(3) T^^nScDNAIl (2) c D N AC-ltlTf) 
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izvibizmnssT& (^a^rp^ 5,536,637 -^tcse©) 0ffigi:^uTi« 

+t- y ±1 □ a -t X • -t U t: -> (Saccharomyces cerevisiae) fa if CDg#§7^ -> 3 

25 iSfcli7 7 ^ y-xsx^K-i^^iii LTfiJffi-rsfc&izfii' > 



7 



PCT/JP99/02283 
WO 99/55863 

_**- 3 «t/'JVi*-*<=. -^•s^t-**'* 3 -*'"** 
M ^ ««>< 

(GENBANK accession No.V0131i; <eww 
SST^^-PSUC2*f»bft. 

* ,~ H- AAH 5^XSF (Gammerer, Methods in Enzymol. 

10 i 192-201 1*3> (ADH7D t -H ^ 

c !; E 1 o r i > 

^ - y^^n^ *4WftSH > yv \Jt / ^" 



20 



25 



8 



WO 99/55863 



PCT/JP99/02283 



1M h$l^lfe7>^^7^7-S:fflliTr^c DNA^ML, 
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300~800bp(Dc DNA^it^, 8$iJi II IS, fffH L J: -5 iz&M 
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2fcfg?f3<7)#SJj£#lte, *-/tJ^^Jl/y>HO-A (Carpal tunnel 

syndrome) . 43 cfcMlC7)EI£> & U teTO ^fIC7)^^(3 GvJjT'fo £ . l£«ttfc 

25 Wcte, ^^7^^HU-y^X^i;u^^'JT-<h[ff]^(Z^II^{c^<Slbn 
TU5t >IXf , J>^ (Sequestering) PI t> ^ * tl 5 . 
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*HaiJ; H -T ;U > ^ A £&15 L ft 'J > fit 4 ;^3t £Mc 

T5c7)T-ft;i/~>^Ate^*fC^T&£o SSJW* K^U > F >^HScf® 

«J fig <t&5o £fcU<-^<7)ft H^U >!i^-&£ft K'VJ Vch^^n 
25 E-*KU >teft >.7-^-775 'J-iTJjt/A-WtOl?! 
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25 Sfe^HJW^'J^y^- K£fljt>T, ^R«^7XX h-^XX 
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fiv>^ fata U , <£> 4 B#P B 1ig-*f£ (I« n/i 3 H £ jRiJ^f o C 

{5^^<£J&3S££te7>^-fc>XDNA (RNA) l:ioT, ^\)^y° 
*^©cDNA?:7'P-7^lTyxy$7^ (genomic) DNA 

a S & ^ ft £ D '> & ^ 4 T + 5^ M £ & 3 U Sfcj6iSi^.TWj:«& 

25 t)$)^)o 
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#58 (c cfc 3 # □ S# co tf> a #t »J t L ■ T It , M co 7Ki4 * # 7jc 

-co£?&M/»teu $^>i:WJ> Sitm ¥Ufc&K ^^fb^J 

15 2>tztr><Dm&<DMm 

MMM 1 : poly (A) + RNACDIES! 
20 T^X 18.5 BRbyE'bWU&WL&DTR I z o 1 £££g (TRIzol reagent, 3£^fS 
GIBCOBRL cb 0 m A) ^ffl^T^rRNA^ffltiSL, mRNAka'j7^ 
->3 > ■ + y h (mRNA Purification Kit, i&o n p£, Pharmacia «fc 0 18 A) 
^tpoly (A) + RNA^iLfc. 

25 «#J2:g$^SST cDNA^-T^^'J-Of^Sa 

±KC0p o 1 y (A) + RNA£g§IJiIX h o I mH^m^Ltz"7 > 9' 9 
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me r : 5 ■ -cGATTGAATTCTAGACCTGCCTCGAGM 

NNNNNNNN-3' (B*I»*H 1> *«' (7 - ilT ' ^ ^ 
^ h .^ x5l: ..>XfA (Superscript Plasmid System for cDNA Syn.nests 

and Plasmid Cloning, Sift*. GIBCOBRL* Jm^ 

9 ^--> 3 >*^ (DNA ligation k i,ve, 2 . «A«. SSffi «fl ^» 

« XhoIT-tmtU r*n-Xt»it3 0 0-8O0bP»cDN 
A ** 0 abT»«U PSUC2 (*«*« 5,536,637 ^#59) »EcoR 
, /N o t I WfcCWiU «tDH 1 0 B«dW^*l'-^/ 

- » c D N A 5 f 7" 5 • J - » ^ X 5 F * W S U , Sf K U ^ A* (Current 

Protocols to Molecular Biology 13.7.1 £*!«W-<*. 

mb MJ ^ h77> (Trp) ^«S*BS»«S« 

f.® 7**h7>- V-J'iK- (Accutran Replica Plater. SffiS. 

, slleicnerSScnuell^A) *^^n=- <^ 
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ULTci^. ^'<tH-.X' (Dynabeads, I^JtM. DYNAL <=fc D ISA) ^ffl^T 

i^SSe^J (D&fe IJDNAy-^r>>>^-+^ h (DNA Sequencing kit (Dye 
5 Terminator Cycle Sequencing Ready Reaction), (^pp4j. Applied Biosystems Inc. 

=k 0 ISA) ^fc'Mft¥l ^-^-^ X ;u > - 7 x > X &Tj5Jft 

£frfel*, SffijDN A^>— ^7X>-tt— 3 7 3 (Applied Biosystems lnc.)Ti%^-SX 

■?# <=> n/rJsasB^J4o cfc $ n s t ^ 7 kss^j (~ -3 u t ^ ■— & 1— x t 

5 5 7 D — ><Z)Bff)t c DNA (J£*T, A5 5 S S TSfrM* c D N A \Z 
^OTMc D N A cD 7 D — ->y«rl4^o S«^$n§7$ 7^SB?iJ 

£g£*n<7»7*7-;i^-7^ K^JtfcJrr-s da 5 5 s s TiH-c dn 

7^7X1 3 0 Jj&!E'frR c DNA7^7'7'J- (Uni-ZAP XR) (Stratagen cfc 
OmX) ©7 7-y|itf$MXL 1 -Blue MR F *tll:i^^tT^b 
20 n/il 0 07J7°7--7 : £t<n>/y7a'>l:h7>X7T-L/;o ' 3 ~Pm 
fSLTcV^XA 5 5 S S Tl^H-c DNA*"7°n— -"7*<hL T77-^/W 

^(D^CD 1 7°^ — 7 7^6 "7 t — >>'*f^U X7 XT-> X h • -\^/\°— • 
77-y (ExAssist helper phage. Stratagene <=t 0 IS A)<htttl AfliMX L 1 - 
2 5 Blue MRF* ^(Stratagene cfc <0 18 A ) iC $ ^ $ it , "7 r — > ; 5 H (pBluescript 
SK(-))i:fiLfc. 77-y$ KS:^IS1DH5 a II !l$ 
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A5 5 S ST^cDNA*lBW*8t^hMtt*. £ttS 

4~14«H«»«» 4 2 5757®. *fcttBW»4K»Sn*4 2 3T 

43 it/ FASTA i 0 . - 

ffltf.j ^ HOTS ;WE*C»UT BLASTX. BLASTP SitfFASTA U 

>B ->.t * ft <i«GF7r 5l J - 1 **** 

ft*. SfcBlASTX. BLASTP *^FASTA«. 7«A55 ? n-> (G 
*», 1 »7 5 V 1 ~ 4 4 8 (Swiss Pro, 

Access.cn HSU03S77 »7 5 / KKM > ~ 3 8 7 P,^*) 
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TUS (Beata Lecka-Czerniket. al. Mol.Cell.Biol.15 120-128 1995) „ ^b(C^-<Z) 

HM#J5 : ?^7A 5 5167^ y 7 - Aitfc^-OiM 
5 fe^M^^^-r SfefelZV^ V >c DNA7>7°U ^r-->3 > + 
7F (Marathon cDNA Amplification Kit, Clontech tti 0 StA)lZ 

5 ' RACE (Rapid Amplification of cDNA End)& 5:fflUT 5 '^ScDNA 

Ifi^Opo 1 y (A) + RNAJ:0^8!Lfc. 

10 ^rgO^SiH^JcDlf fgtcS^V^Ty^-r V— mA 5 5 -R 1 : 5 ' - CG 
TTTGTGCACTGCTGCTGTGCATTCC — 3 ' (B2?U#-^ 1 
2) ^SLT. l&^y hlZ^i^^tlfzT yy'^ — y'^^ 7-tTPCR$: 
ff&o/Co fi^n/ic DNA^T^P-xm^KjT^lli^, pGEM- 
T Vector 085 Promega ctOBIA) tCj^U AJllffiD H 5 a 

15 ^LT^Tsn^iil. 3r±&SBr^iJ£&/£Lfco ^OJS*, @r2?'J#^3 

tifc^D->©x^y> i ffi.ftm$4 o 0t&STfiStH#^E"r5S'J(7)x^y > 

^i 975y^i:ii$nt7i'V7*-Aia (ge^j#-^ 6 tc^T) £ 

25 5 Jm^\3 4 0-1614 fUj<7)fig*£) 4 2 57$ /g££fc ft, BE^JfHt 9 tC>K 
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pNotS (Kaufman e. al., Nucleic Acids Re ,19,4485-4490(1991)#!«) CSS* 
b 5 £6588(1175*3 H p N o t S -mA 5 5 a«*Ufc. 

GJBCOBRLiOBA) MW2 9 3Tfflja (ATCC CRL-1573 2 9 SUB 
rS V40 TSBS«AbfcSfflia*t)«AL, 19W»C"S-^* 
= > (Me t) «»W : Met7.J-oMCS*^3 03(3^»U 

r« Me t*«tnwrsi»iiiwi*fia^fc. mu»*b**. t> 

MJ 3 >-10 («««. AmiconiOBA) CT»1 OftEMIU SDS 

s n/-ae»®5sa*BAS2ooo (*±7-r;w») *fflv^!Hb&. 

s P N o t S -BA 5 5 SSAUfc 2 9 3 TIM«)1M±*1=B. 

na ^>W6 0~7 0kDa#fil:». 

t a Ttf X A 5 5 6 »»T*147 5 / ■«** 

s0 «A6 5«<H«48kD.i0t«<. ^A 5 5ieM2^ 
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thPDGF-BB (GENZYME *fc§3) 1, 3 &fzte 1 0 n g /m 1 £ m 
f^tC mtfcM 6 CO^rffiT p K o t S JfctJp N o t S -mA 5 5 $iXLfc 
5 2 9 3 T»CO±§*±tf ^^ri^t&ScO 1 0 %\Zt&Z>£.o iZffifinls, *fflM?it 
EL ISA, BrdU»7h (j§Jn n a£> ^- U >#-V >/W Ack 018 
A) CD^ritlL/i^oT, J(ilf¥il»B r dU©10^^i/IL^o 
-f-cZ^SJ^ HI 1 t^tct "5 C, 7 7hit™WttpNo t SCO^.^ 

mxLtz2 9 3 TMm<Dmm±m&muLtzm£;ttmmu(Dm&£&&L~c 

10 iiiT^of;^ p N o t S - mA 5 5 ^AL/i 2 9 3 T »l^i± 

ifcPDGFSl, 3, 1 0 n g/m 1 OlST^iDLT. iSftSWl: 

hik<g¥-mffiMMlZ&rtZ>B r d UcD^0&*£±#£l*Jt*§'&l;:i3l> 
Tfe, pNot S<Z)*£i*AL-£2 9 3 T*fflJ&<Z)ig#±m£RB$tI^n L /i 
15 tl^(i^iP(7)ti^-ii:bfeLT4ff^^T$,o^^\ pNotS-mA5 5£ 
mALfz2 9 3 T«(Dl§#±m^^PLfcti^-{C«WB/cCB r dUO&O 

£ W-T S d <h ^tU-otzo 

20 

@1Bi£t3cfc9£BELfc3«&<Z>^XA 5 SSP^^^H 
RTNPVYRGPYSNPYSTSYSG (7 1- 90) (BB#I#^ 1 
4 8- 6 7) 

25 GAYYIFQIKSGNEGREFYMR ( 3 7 6 - 3 9 5) (@n?iJ#^ 
103 5 3-3 7 2) 
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MTRP I KGPRD I QLDLEMI TVN (4 0 6 -4 2 6) (E*J# 
^- 1 60 3 8 3~4 0 3) 

6 S D S - P AG E 

le^r^j^^Wb^k'n^p (pvdf«, aa*. 
- oo ^ t * e> i# e> nfc# -J * □ - ^-^fr^ WAssia £W 

3ft bWS 31 <h#f6&2n&. 
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10 3. Ht^coigHmi mzmmzftfztfi) k^h- nt§ c dna„ 
4. @e?ij#-5§2, 5, 7 ^fctti o^fskzn&i&mffimfrh>uz>m3<(Dffim 

Wi3mmWi<D c DNA, ^fcte^E^JKg&WtCA^yU^-T Xt57 7 
> c DNA„ 

15 

c DNA, 

20 6. Ht>)<CO|gH^3^^^^5^CO^-rn^C7)^H|B®(Dc DNA^b/i;5 
25 

8. fl^otSKfg iri^/r(i^2 31tcfaiic$n/t# l J^-7°^- K£585t£-tt5 
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9 . ft*®«0* 1 2 SldE** *lfc# U ^^^^ D - 



10 



20 is. gt*o«B*ia*fctt«2acB«snfc#u^H*ffl^T, 
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A55igS±jS 
PDGF Ing/ml (PI) 
PDGF 3ng/ml (P3) 

PDGF lOng/ml (P10) 

+pi 

+P3 

+P10 
A55igg±jg +P1 

A55igg±;t +P3 

A55tgg±rS +P10 




0.1 0.2 0.3 04 0.5 0.6 
«W££(A450) 



0.7 



1 / 1 
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SEQUENCE LISTING 
<110> Ono Pharmaceutical Co.. Ltd. 

< 1 20> Novel polypept ides, cDNA coding these polypept ides and Use thereof 

<1 30 > ONF-2969PCT 
< 1 4 1 > 1999-04-28 

<150> IP 10-119731 
< 1 5 1 > 1998-04-28 

<160> 12 

<170> Patentln Ver. 2. 0 

<210> 1 

\21W 448 

<2 1 2 > PRT 

<213> Mus musculus 

<400> 1 

Met Pro Gly Leu Lys Arg He Leu Thr Val Thr He Leu Ala Leu Trp 
-23 -20 -1 5 -10 

Leu Pro His Pro Gly Asn Ala Gin Gin Gin Cys Thr Asn Gly Phe Asp 
-5 1 5 
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Leu Asp Arg Gin Ser Gly Gin Cys Leu Asp lie Asp Glu Cys Arg Thr 
10 15 20 25 

He Pro Glu Ala Cys Arg Gly Asp Met Met Cys Val Asn Gin Asn Gly 

30 35 40 

Gly Tyr Leu Cys He Pro Arg Thr Asn Pro Val Tyr Arg Gly Pro Tyr 

45 50 55 

Ser Asn Pro Tyr Ser Thr Ser Tyr Ser Gly Pro Tyr Pro Ala Ala Ala 

60 65 70 

Pro Pro Val Pro Ala Ser Asn Tyr Pro Thr lie Ser Arg Pro Leu Val 

75 80 85 

Cys Arg Phe Gly Tyr Gin Met Asp Glu Gly Asn Gin Cys Val Asp Val 
90 95 100 105 

Asp Glu Cys Ala Thr Asp Ser His Gin Cys Asn Pro Thr Gin He Cys 

110 115 120 

He Asn Thr Glu Gly Gly Tyr Thr Cys Ser Cys Thr Asp Gly Tyr Trp 

125 130 135 

Leu Leu Glu Gly Gin Cys Leu Asp He Asp Glu Cys Arg Tyr Gly Tyr 

140 145 150 

Cys Gin Gin Leu Cys Ala Asn Val Pro Gly Ser Tyr Ser Cys Thr Cys 

155 160 165 

Asn Pro Gly Phe Thr Leu Asn Asp Asp Gly Arg Ser Cys Gin Asp Val 
170 175 180 185 

Asn Glu Cys Glu Thr Glu Asn Pro Cys Val Gin Thr Cys Val Asn Thr 

190 195 200 

Tyr Gly Ser Phe He Cys Arg Cys Asp Pro Gly Tyr Glu Leu Glu Glu 
205 210 215 
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Asp Gly lie His Cys Ser Asp Met Asp G 1 u Cys Ser Phe Ser Glu Phe 

220 225 230 

Leu Cys Gin His Glu Cys Val Asn Gin Pro Gly Ser Tyr Phe Cys Ser 

235 240 245 

Cys Pro Pro Gly Tyr Val Leu Leu Asp Asp Asn Arg Ser Cys Gin Asp 
250 255 260 265 

lie Asn Glu Cys Glu His Arg Asn His Thr Cys Thr Ser Leu Gin Thr 

270 275 280 

Cys Tyr Asn Leu Gin Gly Gly Phe Lys Cys lie Asp Pro He Ser Cys 

285 290 295 

Glu Glu Pro Tyr Leu Leu He Gly Glu Asn Arg Cys Met Cys Pro Ala 

300 305 310 

Glu His Thr Ser Cys Arg Asp Gin Pro Phe Thr He Leu Tyr Arg Asp 

315 320 325 

Met Asp Val Val Ser Gly Arg Ser Val Pro Ala Asp He Phe Gin Met 
330 335 340 345 

Gin Ala Thr Thr Arg Tyr Pro Gly Ala Tyr Tyr He Phe Gin He Lys 

350 355 360 

Ser Gly Asn Glu Gly Arg Glu Phe Tyr Met Arg Gin Thr Gly Pro lie 

365 370 375 

Ser Ala Thr Leu Val Met Thr Arg Pro He Lys Gly Pro Arg Asp He 

380 385 390 

Gin Leu Asp Leu Glu Met He Thr Val Asn Thr Val lie Asn Phe Arg 

395 400 405 

Gly Ser Ser Val lie Arg Leu Arg He Tyr Val Ser Gin Tyr Pro Phe 
410 415 420 425 
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<210> 2 

<2 1 1 > 1344 

<21Z> DNA 

<21 3> Mus musculus 

















Q t (T/ 1 f 1 O CT CT O t 

a i gccagga i 


I dddddggd L 


dL I L dL I g L I 


n n r* n t r> t t CTCT 

dL L d I L L L gg 


cac tc tggc t 


tccacatcc t 


60 


gggaa tgcac 


1 rfP O (TP 1 fT t CT 

agcagcagi g 


L dL dddLggL 


t t t fTQ PP t CTCT 

I I I gdLL L gg 


accgccagtc 


aggacagtgt 


120 


^ t OfT'l t !1 t i CT 

L I dgd I d I L g 


n t rr q o \ ct r* r* ct 
d I gdd IgLLg 


fTn p p o t c* r* c t 
gdL L d I III I 


O" Q (TOT* t t CT T c 

gdggL I I g I L 


gtggggacat 


gatgtgtgtc 


180 


ddL L dead L g 


CTf* CTCtCT i Q t t t 

gCggg I d I I I 


or t ct r* o t />pr t 
g I gL d I L L L L 


c ct 'i n r* r» q q p r* 
L gddL L ddL L 


cagtgtatcg agggecttae 


240 


1 L ddd I C L L I 


dL L L I dL d I L 


P t i P Q (TOT* 

L L dL L L dggL 


r* q t ci r* r* p q or 
LLdldLLL dg 


cagcggcccc 


accagtacca 


300 


gL I I L L ddL L 


dlLll dL gd L 


L L L ddggL L I 


r*t t or f r t or* r* 
L L L g L L L gL L 


gctttgggta 


tcagatggat 


360 


gaaggcaacc 


agtgtgtgga 


tgtggacgag 


tgtgcaacag 


ac tcacacca 


gtgcaaccc t 


420 


acccagatc t 


gtatcaacac 


tgaaggaggt 


tacacc tgc t 


cc tgeacega 


tgggtactgg 480 


ct tctggaag 


ggcagtgcct 


agatat tgat 


gaatgtcgct 


atggt tactg 


ccagcagctc 


540 


tgtgcaaatg 


t tccaggat c 


c tat tec tgt 


acatgcaacc 


c tggt t tcac 


cc tcaacgac 


600 


gatggaaggt 


c t tgccaaga 


tgtgaacgag 


tgcgaaactg 


agaatccc tg 


tgt tcagacc 


660 


tgtgtcaaca 


cc tatggc tc 


1 1 teatctge 


cgctgtgacc 


caggatatga 


act tgaggaa 


720 


gatggcat tc 


ac tgcagtga 


tatggacgag 


tgeaget tct 


ccgagt tec t 


c tgtcaacac 


780 


gagtgtgtga 


accagccggg 


ctcatact tc 


tgctcgtgcc 


ctccaggcta 


cgtcctgt tg 


840 


gatgat aacc 


gaagctgcca 


ggatatcaat 


gaatgtgagc 


accgaaacca 


cacgtgtacc 


900 


tcac tgcaga 


c t tgc tacaa 


tc tacaaggg 


ggct tcaaat 


gtat tgatcc 


catcagc tgt 


960 


gaggagcct t 


atctgctgat 


tggtgaaaac 


cgc tgtatgi 


gtcc tgctga 


gcacaccagc 


1020 


tgcagagacc 


agccat tcac 


catcc tgtat 


egggacatgg atgtggtgtc 


aggaege tec 


1080 
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gttcctgctg acatcttcca gatgcaagca acaacccgat accctggtgc ctattacatt 1140 
ttccagatca aatctggcaa cgagggtcga gagttctata igcggcaaac agggcctatc 1200 
agtgccaccc tggtgatgac acgccccatc aaagggcctc gggacatcca gctggacttg 1260 
gagatgatca ctgtcaacac tgtcatcaac ttcagaggca gctccgtgat ccgactgcgg 1320 



<210> 3 
<2 1 1 > 2233 
<212> DNA 
<213'' Mus musculus 

<220> 

<223> Clone mouse A55 derived from Day 13 mouse embryonic heart 

<220> 

<221> CDS 

<222-' (75). . (1418) 

<220> 

\22\'y sig_peptide 
<222> (75). . (143) 

<22(T- 

<2 2 1 mat_pept ide 
<222> (144). . (1418) 



atatatgtgt cgcagtatcc gttc 



1344 
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<400> 3 

aattcggcac gagccccagt cccaccgcag agcctgcctt cctcgcgtcg cttctcctcc 60 

cgcgcatctt ggat atg cca gga tta aaa agg ata etc act gtt acc ate 110 

Met Pro Gly Leu Lys Arg He Leu Thr Val Thr He 

-20 -15 

ttg gca etc tgg ctt cca cat cct ggg aat gca cag cag cag tgc aca 158 

Leu Ala Leu Trp Leu Pro His Pro Gly Asn Ala Gin Gin Gin Cys Thr 

-10 -5 -11 5 

aac ggc ttt gac ctg gac cgc cag tea gga cag tgt eta gat att gat 206 

Asn Gly Phe Asp Leu Asp Arg Gin Ser Gly Gin Cys Leu Asp He Asp 

10 15 20 

gaa tgc egg acc ate cct gag get tgt cgt ggg gac atg atg tgt gtc 254 

Glu Cys Arg Thr He Pro Glu Ala Cys Arg Gly Asp Met Met Cys Val 

25 30 35 

aac cag aat ggc ggg tat ttg tgc ate cct cga acc aac cca gtg tat 302 

Asn Gin Asn Gly Gly Tyr Leu Cys He Pro Arg Thr Asn Pro Val Tyr 

40 45 50 

cga ggg cct tac tea aat ccc tac tct aca tec tac tea ggc cca tac 350 

Arg Gly Pro Tyr Ser Asn Pro Tyr Ser Thr Ser Tyr Ser Gly Pro Tyr 

55 60 65 

cca gca gcg gee cca cca gta cca get tec aac tac ccc acg an tea 398 

Pro Ala Ala Ala Pro Pro Val Pro Ala Ser Asn Tyr Pro Thr He Ser 

70 75 80 85 

agg cct ctt gtc tgc cgc ttt ggg tat cag atg gat gaa ggc aac cag 446 

Arg Pro Leu Val Cys Arg Phe Gly Tyr Gin Met Asp Glu Gly Asn Gin 
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tgt gtg gat gtg 
Cys Val Asp Val 
105 

acc cag ate tgt 
Thr Gin He Cys 
120 

gat ggg tac tgg 
Asp Gly Tyr Trp 
135 

cgc tat ggt tac 
Arg Tyr Gly Tyr 
150 

tec tgt aca tgc 
Ser Cys Thr Cys 

tgc caa gat gtg 
Cys Gin Asp Val 
185 

tgt gtc aac acc 
Cys Val Asn Thr 
200 

gaa ctt gag gaa 
Glu Leu Glu Glu 
215 

ttc tec gag ttc 



90 

gac gag tgt gca aca 
Asp Glu Cys Ala Thr 
110 

ate aac act gaa gga 
He Asn Thr Glu Gly 
125 

ctt ctg gaa ggg cag 
Leu Leu Glu Gly Gin 
140 

tgc cag cag etc tgt 
Cys Gin Gin Leu Cys 
155 

aac cct ggt ttc acc 
Asn Pro Gly Phe Thr 
170 

aac gag tgc gaa act 
Asn Glu Cys Glu Thr 
190 

tat ggc tct ttc ate 
Tyr Gly Ser Phe He 
205 

gat ggc att cac tgc 
Asp Gly He His Cys 
220 

etc tgt caa cac gag 



95 

gac tea cac cag 
Asp Ser His Gin 

ggt tac acc tgc 
Gly Tyr Thr Cys 
130 

tgc eta gat att 
Cys Leu Asp I le 
145 

gca aat gt t cca 
Ala Asn Val Pro 
160 

etc aac gac gat 
Leu Asn Asp Asp 
175 

gag aat ccc tgt 
Glu Asn Pro Cys 

tgc cgc tgt gac 
Cys Arg Cys Asp 
210 

agt gat atg gac 
Ser Asp Met Asp 
225 

tgt gtg aac cag 



100 

tgc aac cct 494 
Cys Asn Pro 
115 

tec tgc acc 542 
Ser Cys Thr 

gat gaa tgt 590 
Asp Glu Cys 

gga tec tat 638 
Gly Ser Tyr 
165 

gga agg tct 686 
Gly Arg Ser 
180 

gt t cag acc 734 
Val Gin Thr 
195 

cca gga tat 782 
Pro Gly Tyr 

gag tgc age 830 
Glu Cys Ser 

ccg ggc tea 878 
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Phe Ser Glu Phe 
230 

tac ttc tgc teg 
Tyr Phe Cys Ser 

age tgc cag gat 
Ser Cys Gin Asp 
265 

tea ctg cag act 
Ser Leu Gin Thr 
280 

ccc ate age tgt 
Pro He Ser Cys 
295 

atg tgt cct get 
Met Cys Pro Ala 
310 

ctg tat egg gac 
Leu Tyr Arg Asp 

ate ttc cag atg 
He Phe Gin Met 
345 

1 1 c cag ate aaa 
Phe Gin He Lys 
360 



Leu Cys Gin His Glu 
235 

tgc cct cca ggc tac 
Cys Pro Pro Gly Tyr 
250 

ate aat gaa tgt gag 
1 1 e Asn Glu Cys Glu 
270 

tgc tac aat eta caa 
Cys Tyr Asn Leu Gin 
285 

gag gag cct tat ctg 
Glu Glu Pro Tyr Leu 
300 

gag cac ace age tgc 
Glu His Thr Ser Cys 
315 

atg gat gtg gtg tea 
Met Asp Val Val Ser 
330 

caa gca aca acc cga 
Gin Ala Thr Thr Arg 

350 

tct ggc aac gag ggt 
Ser Gly Asn Glu Gly 
365 



Cys Val Asn Gin 
240 

gtc ctg t tg gat 

Val Leu Leu Asp 
255 

cac cga aac cac 

His Arg Asn His 

ggg ggc ttc aaa 
Gly Gly Phe Lys 
290 

ctg at t ggt gaa 
Leu He Gly Glu 
305 

aga gac cag cca 
Arg Asp Gin Pro 
320 

gga cgc tec gt t 
Gly Arg Ser Val 
335 

tac cct ggt gec 
Tyr Pro Gly Ala 

cga gag ttc tat 
Arg Glu Phe Tyr 
370 



Pro Gly Ser 
245 

gat aac cga 926 
Asp Asn Arg 
260 

acg tgt acc 974 
Thr Cys Thr 
275 

tgt att gat 1022 
Cys I le Asp 

aac cgc tgt 1070 
Asn Arg Cys 

t tc acc ate 1118 
Phe Thr He 

325 

cct get gac 1166 
Pro Ala Asp 
340 

tat tac att 1214 
Tyr Tyr He 
355 

atg egg caa 1262 
Met Arg Gin 
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aca ggg cct ate agt gec acc ctg gtg atg aca cgc ccc ate aaa ggg 1310 
Thr Gly Pro He Ser Ala Thr Leu Val Met Thr Arg Pro He Lys Gly 

375 380 385 

cct egg gac ate cag ctg gac ttg gag atg ate act gtc aac act gtc 1358 
Pro Arg Asp lie Gin Leu Asp Leu Glu Met He Thr Val Asn Thr Val 
390 395 400 405 

ate aac ttc aga ggc age tec gtg ate cga ctg egg ata tat gtg teg 1406 
He Asn Phe Arg Gly Ser Ser Val He Arg Leu Arg He Tyr Val Ser 

410 415 420 

cag tat ccg ttc tgagcctctg getaaggect ctgacactgc ctttcaccag 1458 
Gin Tyr Pro Phe 
425 

caccgaggga egggaggaga aaggaaacca gcaagaatga gagegagaca gaeattgeae 1518 
ctttcctgct gaatatctcc tgggggcatc agectagcat cttgacccat atctgtacta 1578 
ttgcagatgg tcactctgaa ggacaccctg ccctcagttc etatgatgea gttatccaaa 1638 
agtgttcatc ttagcccctg atatgaggtt gecagtgact cttcaaagcc ttccatttat 1698 
ttccatcgtt ttataaaaaa gaaaatagat tagatttget ggggtatgag tcctcgaagg 1758 
ttcaaaagac tgagtggctt gctctcacct cttcctctcc ttcctccatc tettgetgea 1818 
ttgctgcttt gcaaaagtcc tcatgggctc gtgggaaatg ctgggaatag etagtttget 1878 
tettgeatgt tc tgagaagg c tatgggaac acaccacagc aggatcgaag gt t t t tatag 1938 
agtctatttt aaaatcacat ctggtatttt cagcataaaa gaaattttag ttgtctttaa 1998 
aatttgtatg agtgtttaac cttttcttat tcattttgag gcttcttaaa gtggtagaat 2058 
tccttccaaa ggectcagat acatgttatg ttcagtcttt ccaacctcat cctttcctgc 2118 
atcttagccc agtttttacg aagacccctt aatcatgett tnttaagagt ttttacccaa 2178 
c tgcgt tgga agacagaggt atccagac tg at t aaataat tgaagaaaaa aaaaa 2233 
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•;210> 4 

\21P 423 

\212^ PRT 

<213.' Mus musculus 

<400~' 4 

Gin Cys Thr Asn Gly Phc Asp Leu Asp Arg Gin Ser Gly Gin Cys Leu 

15 10 15 

Asp lie Asp Glu Cys Arg Thr He Pro Glu Ala Cys Arg Gly Asp Met 

20 25 30 

Met Cys Val Asn Gin Asn Gly Gly Tyr Leu Cys lie Pro Arg Thr Asn 

35 40 45 

Pro Val Tyr Arg Gly Pro Tyr Ser Asn Pro Tyr Ser Thr Ser Tyr Ser 

50 55 60 

Gly Pro Tyr Pro Ala Ala Ala Pro Pro Val Pro Ala Ser Asn Tyr Pro 
65 70 75 80 

Thr He Ser Arg Pro Leu Val Cys Arg Phe Gly Tyr Gin Met Asp Glu 

85 90 95 

Gly Asn Gin Cys Val Asp Val Asp Glu Cys Ala Thr Asp Ser His Gin 

100 105 110 

Cys Asn Pro Thr Gin He Cys He Asn Thr Glu Gly Gly Tyr Thr Cys 

115 120 125 

Ser Cys Thr Asp Gly Tyr Trp Leu Leu Glu Gly Gin Cys Leu Asp He 

130 135 140 

Asp Glu Cys Arg Tyr Gly Tyr Cys Gin Gin Leu Cys Ala Asn Val Pro 
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145 150 155 160 

Gly Ser Tyr Ser Cys Thr Cys Asn Pro Gly Phe Thr Leu Asn Asp Asp 

165 170 175 

Gly Arg Ser Cys Gin Asp Val Asn Glu Cys Glu Thr Glu Asn Pro Cys 

180 185 190 

Val Gin Thr Cys Val Asn Thr Tyr Gly Ser Phe lie Cys Arg Cys Asp 

195 200 205 

Pro Gly Tyr Glu Leu Glu Glu Asp Gly lie His Cys Ser Asp Met Asp 

210 215 220 

Glu Cys Ser Phe Ser Glu Phe Leu Cys Gin His Glu Cys Val Asn Gin 
225 230 235 240 

Pro Gly Ser Tyr Phe Cys Ser Cys Pro Pro Gly Tyr Val Leu Leu Asp 

245 250 255 

Asp Asn Arg Ser Cys Gin Asp He Asn Glu Cys Glu His Arg Asn His 

260 265 270 

Thr Cys Thr Ser Leu Gin Thr Cys Tyr Asn Leu Gin Gly Gly Phe Lys 

275 280 285 

Cys lie Asp Pro lie Ser Cys Glu Glu Pro Tyr Leu Leu lie Gly Glu 

290 295 300 

Asn Arg Cys Met Cys Pro Ala Glu His Thr Ser Cys Arg Asp Gin Pro 
305 310 315 320 

Phe Thr He Leu Tyr Arg Asp Met Asp Val Val Ser Gly Arg Ser Val 

325 330 335 

Pro Ala Asp lie Phe Gin Met Gin Ala Thr Thr Arg Tyr Pro Gly Ala 

340 345 350 

Tyr Tyr He Phe Gin He Lys Ser Gly Asn Glu Gly Arg Glu Phe Tyr 
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355 360 365 

Met Arg Gin Thr Gly Pro lie Ser Ala Thr Leu Val Met Thr Arg Pro 

370 375 380 

He Lys Gly Pro Arg Asp He Gin Leu Asp Leu Glu Met He Thr Val 
385 390 395 400 

Asn Thr Val He Asn Phe Arg Gly Ser Ser Val He Arg Leu Arg lie 

405 410 415 

Tyr Val Ser Gin Tyr Pro Phe 
420 



<210> 5 

<2 1 1 > 1269 

<212> DNA 

<213> Mus musculus 

<400> 5 

cagtgcacaa acggctttga cctggaccgc 
tgccggacca tccctgaggc ttgtcgtggg 
tatttgtgca tccctcgaac caacccagtg 
acatcctact caggcccata cccagcagcg 
acgat t tcaa ggcctct tgt ctgccgcttt 
gtggatgtgg acgagtgtgc aacagactca 
aacactgaag gaggttacac ctgctcctgc 
tgcctagata ttgatgaatg tcgctatggt 
ggatcctatt cctgtacatg caaccctggt 



cagtcaggac agtgtctaga tattgatgaa 60 

gacatgatgt gtgtcaacca gaatggcggg 120 

tatcgagggc cttactcaaa tccctactct 180 

gccccaccag taccagcttc caactacccc 240 

gggtatcaga tggatgaagg caaccagtgt 300 

caccagtgca accctaccca gatcigtatc 360 

accgatgggt actggcttct ggaagggcag 420 

tactgccagc agctctgtgc aaatgttcca 480 

ttcaccctca acgacgatgg aaggtcttgc 540 
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caagatgtga acgagtgcga aactgagaat ccctgtgttc agacctgtgt caacacctat 600 
ggctctttca tctgccgctg tgacccagga tatgaactlg aggaagatgg cattcactgc 660 
agtgatatgg acgagtgcag cttctccgag ttcctctgtc aacacgagtg tgtgaaccag 720 
ccgggctcat acttctgctc gtgccctcca ggctacgtcc tgttggatga taaccgaagc 780 
tgccaggata tcaatgaatg tgagcaccga aaccacacgt gtacctcact gcagacttgc 840 
tacaatctac aagggggctt caaatgtatt gatcccatca gctgtgagga gccttatctg 900 
ctgattggtg aaaaccgctg tatgtgtcct gctgagcaca ccagctgcag agaccagcca 960 
ttcaccatcc tgtatcggga catggatgtg gtgtcaggac gc tccgt tec tgetgacate 1020 
ttccagatgc aagcaacaac ccgataccct ggtgcctatt acattttcca gatcaaatct 1080 
ggcaacgagg gtcgagagt t etatatgegg caaacagggc ctatcagtgc caccctggtg 1140 
atgacacgcc ccatcaaagg gcctcgggac atccagctgg acttggagat gatcactgtc 1200 
aacactgtca tcaacttcag aggcagctcc gtgatccgac tgeggatata tgtgtcgcag 1260 
tatcegtte 1269 

<210> 6 

<211> 461 

<212> PRT 

<213> Mus musculus 

<400> 6 

Met Gly Pro Arg Ser Phe Glu Pro Met His Ser Gly Leu Cys Arg Gin 

-35 -30 -25 

Arg Arg Met lie Leu Thr Val Thr lie Leu Ala Leu Trp Leu Pro His 
-20 -15 -10 -5 

Pro Gly Asn Ala Gin Gin Gin Cys Thr Asn Gly Phe Asp Leu Asp Arg 
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-II 5 10 

Gin Ser Gly Gin Cys Leu Asp He Asp Glu Cys Arg Thr He Pro Glu 

15 20 25 

Ala Cys Arg Gly Asp Met Met Cys Val Asn Gin Asn Gly Gly Tyr Leu 

30 35 40 

Cys He Pro Arg Thr Asn Pro Val Tyr Arg Gly Pro Tyr Ser Asn Pro 
45 50 55 60 

Tyr Ser Thr Ser Tyr Ser Gly Pro Tyr Pro Ala Ala Ala Pro Pro Val 

65 70 75 

Pro Ala Ser Asn Tyr Pro Thr He Ser Arg Pro Leu Val Cys Arg Phe 

80 85 90 

Gly Tyr Gin Met Asp Glu Gly Asn Gin Cys Val Asp Val Asp Glu Cys 

95 100 105 

Ala Thr Asp Ser His Gin Cys Asn Pro Thr Gin He Cys He Asn Thr 

110 115 120 

Glu Gly Gly Tyr Thr Cys Ser Cys Thr Asp Gly Tyr Trp Leu Leu Glu 
125 130 135 140 

Gly Gin Cys Leu Asp He Asp Glu Cys Arg Tyr Gly Tyr Cys Gin Gin 

145 150 155 

Leu Cys Ala Asn Val Pro Gly Ser Tyr Ser Cys Thr Cys Asn Pro Gly 

160 165 170 

Phe Thr Leu Asn Asp Asp Gly Arg Ser Cys Gin Asp Val Asn Glu Cys 

175 180 185 

Glu Thr Glu Asn Pro Cys Val Gin Thr Cys Val Asn Thr Tyr Gly Ser 

190 195 200 

Phe He Cys Arg Cys Asp Pro Gly Tyr Glu Leu Glu Glu Asp Gly He 
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205 210 215 220 

His Cys Ser Asp Met Asp Glu Cys Ser Phe Ser Glu Phe Leu Cys Gin 

225 230 235 

His Glu Cys Val Asn Gin Pro Gly Ser Tyr Phe Cys Ser Cys Pro Pro 

240 245 250 

Gly Tyr Val Leu Leu Asp Asp Asn Arg Ser Cys Gin Asp lie Asn Glu 

255 260 265 

Cys Glu His Arg Asn His Thr Cys Thr Ser Leu Gin Thr Cys Tyr Asn 

270 275 280 

Leu Gin Gly Gly Phe Lys Cys lie Asp Pro He Ser Cys Glu Glu Pro 
285 290 295 300 

Tyr Leu Leu He Gly Glu Asn Arg Cys Met Cys Pro Ala Glu His Thr 

305 310 315 

Ser Cys Arg Asp Gin Pro Phe Thr lie Leu Tyr Arg Asp Met Asp Val 

320 325 330 

Val Ser Gly Arg Ser Val Pro Ala Asp He Phe Gin Met Gin Ala Thr 

335 340 345 

Thr Arg Tyr Pro Gly Ala Tyr Tyr He Phe Gin He Lys Ser Gly Asn 

350 355 360 

Glu Gly Arg Glu Phe Tyr Met Arg Gin Thr Gly Pro He Ser Ala Thr 
365 370 375 380 

Leu Val Met Thr Arg Pro He Lys Gly Pro Arg Asp He Gin Leu Asp 

385 390 395 

Leu Glu Met He Thr Val Asn Thr Val He Asn Phe Arg Gly Ser Ser 

400 405 410 

Val He Arg Leu Arg He Tyr Val Ser Gin Tyr Pro Phe 
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415 



420 



425 



<210> 7 

<211> 1383 

<2 1 2> DNA 

< 2 1 3 > Mus musculus 



<400> 7 
















era a 2 1 1 t c era 


gccaatgcac 


agtggactct 


gcagacagag 


aegcatgata 


60 




c c a t r t t erer 


actctggct t 


ccacatcc tg 


ggaatgeaca 


gcagcagtgc 


120 


a r a aacererc t 


1 1 era r r t 


ccgccagtca ggacagtgtc 


tagatat tga 


tgaatgeegg 


180 






tggggacatg atgtgtgtca 


accagaatgg 


egggtat ttg 


240 


tgcatccctc 


gaaccaaccc 


agtgtatcga 


gggect tact 


caaatcccta 


c tc tacatcc 


300 


tactcaggcc 


catacccagc 


agcggcccca 


ccagtaccag 


ct tccaacta 


ccccacgat t 


360 


tcaaggcctc 


t tgtctgccg 


ct t tgggtat 


cagatggatg 


aaggcaacca 


gtgtgtggat 


420 


gtggacgagt 


gtgeaacaga 


ctcacaccag 


tgcaaccc ta 


cccagatctg 


tatcaacac t 


480 


gaaggaggt t 


acacctgctc 


c tgcaccgat 


gggtactggc 


t tctggaagg geagtgecta 


540 


gatat tgatg 


aatgtcgc ta 


tggt tactgc 


cagcagctct 


gtgcaaatgt 


tccaggatcc 


600 


tat tec tgta 


catgcaaccc 


tggt t tcacc 


ctcaacgacg 


atggaaggtc 


t tgecaagat 


660 


gtgaacgagt 


gcgaaac tga 


gaatccc tgt 


gt tcagacct 


gtgtcaacac 


c tatggctc t 


720 


t tcatctgcc 


gctgtgaccc 


aggatatgaa 


ct tgaggaag 


atggcat tea 


c tgcagtgat 


780 


atggacgagt 


gcagc t tc tc 


cgagt tcctc 


tgtcaacacg 


agtgtgtgaa 


ccagccgggc 


840 


tcatacttct 


gc tcgtgccc 


tccaggctac 


gtcctgt tgg 


atgataaccg aagctgccag 900 


gatatcaatg 


aatgtgagca 


ccgaaaccac 


acgtgtacc t 


cac tgeagae 


t tgc tacaat 


960 


ctacaagggg 


get tcaaatg 


tat tgatccc 


atcagctgtg aggagect ta 


tc tgc tgat t 


1020 
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ggtgaaaacc gctgtatgtg tcctgctgag 
atcctgtatc gggacatgga tgtggtgtca 
atgcaagcaa caacccgata ccctggtgcc 
gagggtcgag agttctatat gcggcaaaca 
cgccccatca aagggcctcg ggacatccag 
gtcatcaact tcagaggcag ctccgtgatc 
ttc 



cacaccagct gcagagacca gccattcacc 1080 
ggacgctccg ttcctgctga catcttccag 1140 
tattacattt tccagatcaa atctggcaac 1200 
gggcctatca gtgccaccct ggtgatgaca 1260 
ctggacttgg agatgatcac tgtcaacact 1320 
cgactgcgga tatatgtgtc gcagtatccg 1380 

1383 



<210> 8 

<2 1 1 > 2429 

<212> DNA 

<213> Mus musculus 

<220> 

<223> Clone mouse A55b derived from Day 13 mouse embryonic heart 

<220^> 

<221> CDS 

<222> (232). . (1614) 
<220> 

<22P sig_peptide 

<222 > (232). . (339) 

<220;> 
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''221 / mat_pept ide 
•■.'222/' (340). . (1 614) 

*'400> 8 

cagcatctcg agagaggcag cagacaaccl ctctaggtca tttctctttc tttttggaaa 60 
gggcagcaac gttgtgcgca gtttataaaa tatcacacta catgtttttt aaatttggga 120 
gactgctgac tacggcacca gcaattgctt tgctgcgacg gctgtgagac aagcagaagt 180 
ctccgaacac ttctgtctgc gtttgctcta tgtgtgtgat ttacagaggg a atg gga 237 

Met Gly 
-35 

cct aga agt ttc gag cca atg cac agt gga etc tgc aga cag aga cgc 285 
Pro Arg Scr Pne Glu Pro Met His Ser Gly Leu Cys Arg Gin Arg Arg 

-30 -25 -20 

atg ata etc act gtt acc ate ttg gca etc tgg ctt cca cat cct ggg 333 
Met lie Leu Thr Val Thr lie Leu Ala Leu Trp Leu Pro His Pro Gly 

-15 -10 -5 

aat gca cag cag cag tgc aca aac ggc ttt gac ctg gac cgc cag tea 381 
Asn Ala Gin Gin Gin Cys Thr Asn Gly Phe Asp Leu Asp Arg Gin Ser 

-11 5 10 

gga cag tgt eta gat att gat gaa tgc egg acc ate cct gag get tgt 429 
Gly Gin Cys Leu Asp He Asp Glu Cys Arg Thr He Pro Glu Ala Cys 
15 20 25 30 

cgt ggg gac atg atg tgt gtc aac cag aat ggc ggg tat ttg tgc ate 477 
Arg Gly Asp Met Met Cys Val Asn Gin Asn Gly Gly Tyr Leu Cys lie 

35 40 45 

cct cga acc aac cca gtg tat cga ggg cct tac tea aat ccc tac let 525 
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Pro Arg Thr Asn Pro Val Tyr Arg Gly Pro Tyr Ser Asn Pro Tyr Ser 

50 55 60 

aca tec tac tea ggc cca tac cca gca gcg gec cca cca gta cca get 573 
Thr Ser Tyr Ser Gly Pro Tyr Pro Ala Ala Ala Pro Pro Val Pro Ala 

65 70 75 

tec aac tac ccc acg att tea agg cct ctt gtc tgc cgc ttt ggg tat 621 

Ser Asn Tyr Pro Thr lie Ser Arg Pro Leu Val Cys Arg Phe Gly Tyr 

80 85 90 

cag atg gat gaa ggc aac cag tgt gtg gat gtg gac gag tgt gca aca 669 

Gin Met Asp Glu Gly Asn Gin Cys Val Asp Val Asp Glu Cys Ala Thr 

95 100 105 110 

gac tea cac cag tgc aac cct ace cag ate tgt ate aac act gaa gga 717 

Asp Ser His Gin Cys Asn Pro Thr Gin He Cys lie Asn Thr Glu Gly 

115 120 125 

ggt tac acc tgc tec tgc ace gat ggg tac tgg ctt ctg gaa ggg cag 765 
Gly Tyr Thr Cys Ser Cys Thr Asp Gly Tyr Trp Leu Leu Glu Gly Gin 

130 135 140 

tgc eta gat att gat gaa tgt cgc tat ggt tac tgc cag cag etc tgt 813 

Cys Leu Asp lie Asp Glu Cys Arg Tyr Gly Tyr Cys Gin Gin Leu Cys 

145 150 155 

gca aat gtt cca gga tec tat tec tgt aca tgc aac cct ggt ttc acc 861 

Ala Asn Val Pro Gly Ser Tyr Ser Cys Thr Cys Asn Pro Gly Phe Thr 

160 165 170 

etc aac gac gat gga agg tct tgc caa gat gtg aac gag tgc gaa act 909 

Leu Asn Asp Asp Gly Arg Ser Cys Gin Asp Val Asn Glu Cys Glu Thr 

175 180 185 190 
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gag aat ccc tgt gtt cag acc tgt gtc aac acc tat ggc tct ttc ate 957 
Glu Asn Pro Cys Val Gin Thr Cys Val Asn Thr Tyr Gly Ser Phe He 

195 200 205 

tgc cgc tgt gac cca gga tat gaa ctt gag gaa gat ggc att cac tgc 1005 
Cys Arg Cys Asp Pro Gly Tyr Glu Leu Glu Glu Asp Gly lie His Cys 

210 215 220 

agt gat atg gac gag tgc age ttc tec gag ttc etc tgt caa cac gag 1053 
Ser Asp Met Asp Glu Cys Ser Phe Ser Glu Phe Leu Cys Gin His Glu 

225 230 235 

tgt gtg aac cag ccg ggc tea tac ttc tgc teg tgc cct cca ggc tac 1101 
Cys Val Asn Gin Pro Gly Ser Tyr Phe Cys Ser Cys Pro Pro Gly Tyr 

240 245 250 

gtc ctg ttg gat gat aac cga age tgc cag gat ate aat gaa tgt gag 1149 
Val Leu Leu Asp Asp Asn Arg Ser Cys Gin Asp He Asn Glu Cys Glu 
255 260 265 270 

cac cga aac cac acg tgt acc tea ctg cag act tgc tac aat eta caa 1197 
His Arg Asn His Thr Cys Thr Ser Leu Gin Thr Cys Tyr Asn Leu Gin 

275 280 285 

ggg ggc ttc aaa tgt att gat ccc ate age tgt gag gag cct tat ctg 1245 
Gly Gly Phe Lys Cys lie Asp Pro He Ser Cys Glu Glu Pro Tyr Leu 

290 295 300 

ctg att ggt gaa aac cgc tgt atg tgt cct get gag cac acc age tgc 1293 
Leu He Gly Glu Asn Arg Cys Met Cys Pro Ala Glu His Thr Ser Cys 

305 310 315 

aga gac cag cca ttc acc ate ctg tat egg gac atg gat gtg gtg tea 1341 
Arg Asp Gin Pro Phe Thr lie Leu Tyr Arg Asp Met Asp Val Val Ser 
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320 325 330 

gga cgc tec gtt cct get gac ate tic cag atg caa gca aca acc cga 1389 
Gly Arg Ser Vat Pro Ala Asp He Phe Gin Met Gin Ala Thr Thr Arg 
335 340 345 350 

tac cct ggt gee tat tac att ttc cag ate aaa tct ggc aac gag ggt 1437 
Tyr Pro Gly Ala Tyr Tyr He Phe Gin He Lys Ser Gly Asn Glu Gly 

355 360 365 

cga gag ttc tat atg egg caa aca ggg cct ate agt gec acc ctg gtg 1485 
Arg Glu Phe Tyr Met Arg Gin Thr Gly Pro He Ser Ala Thr Leu Val 

370 375 380 

atg aca cgc ccc ate aaa ggg cct egg gac ate cag ctg gac ttg gag 1533 
Met Thr Arg Pro He Lys Gly Pro Arg Asp He Gin Leu Asp Leu Glu 

385 390 395 

atg ate act gtc aac act gtc ate aac ttc aga ggc age tec gtg ate 1581 
Met lie Thr Val Asn Thr Val lie Asn Phe Arg Gly Ser Ser Val He 

400 405 410 

cga ctg egg ata tat gtg teg cag tat ccg ttc tgagcctctg getaaggect 1634 
Arg Leu Arg He Tyr Val Ser Gin Tyr Pro Phe 
41 5 420 425 

ctgacactgc ctttcaccag caccgaggga egggaggaga aaggaaacca gcaagaatga 1694 
gagegagaca gacat tgcac ctttcctgct gaatatctcc tgggggcatc agectagcai 1754 
cttgacccat atctgtacta ttgcagatgg tcactctgaa ggacaccctg ccctcagttc 1814 
etatgatgea gttatccaaa agigttcatc ttagcccctg atatgaggtt gecagtgact 1874 
cttcaaagcc ttccatttat ttccatcgtt ttataaaaaa gaaaatagat tagatttget 1934 
ggggtatgag tcctcgaagg ttcaaaagac tgagtggctt gctctcacct cttcctctcc 1994 
ttcctccatc tettgetgea ttgctgcttt gcaaaagtcc tcatgggctc gtgggaaatg 2054 



21/26 



WO 99/55863 PCT/JP99/02283 



ctgggaatag ctagtttgct tcttgcatgt tctgagaagg ctatgggaac acaccacagc 2114 
aggatcgaag gtttttatag agtctatttt aaaatcacat ctggtatttt cagcataaaa 2174 
gaaattttag ttgtctttaa aatttgtatg agtgtttaac cttttcttat tcattttgag 2234 
gcttcttaaa gtggtagaat tccttccaaa ggcctcagat acatgt tatg t tcagtc t t t 2294 
ccaacctcat cctttcctgc atcttagccc agtttttacg aagacccctt aatcatgctt 2354 
tnttaagagt t t t tacccaa ctgcgt tgga agacagaggt atccagac tg at taaataat 2414 
tgaagaaaaa aaaaa 2429 



<210> 9 

<211> 423 

<212 N > PRT 

<213> Mus musculus 



<400> 9 

Gin Cys Thr Asn Gly Phe Asp Leu 

1 5 
Asp lie Asp Glu Cys Arg Thr He 
20 

Met Cys Val Asn Gin Asn Gly Gly 
35 40 
Pro Val Tyr Arg Gly Pro Tyr Ser 

50 55 
Gly Pro Tyr Pro Ala Ala Ala Pro 

65 70 
Thr lie Ser Arg Pro Leu Val Cys 



Asp Arg Gin Ser Gly Gin Cys Leu 

10 15 
Pro Glu Ala Cys Arg Gly Asp Met 

25 30 
Tyr Leu Cys He Pro Arg Thr Asn 
45 

Asn Pro Tyr Ser Thr Ser Tyr Ser 
60 

Pro Val Pro Ala Ser Asn Tyr Pro 
75 80 
Arg Phe Gly Tyr Gin Met Asp Glu 
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85 90 95 

Gly Asn Gin Cys Val Asp Val Asp Glu Cys Ala Thr Asp Ser His Gin 

100 105 110 

Cys Asn Pro Thr Gin He Cys He Asn Thr Glu Gly Gly Tyr Thr Cys 

115 120 125 

Ser Cys Thr Asp Gly Tyr Trp Leu Leu Glu Gly Gin Cys Leu Asp lie 

130 135 140 

Asp Glu Cys Arg Tyr Gly Tyr Cys Gin Gin Leu Cys Ala Asn Val Pro 
145 150 155 160 

Gly Ser Tyr Ser Cys Thr Cys Asn Pro Gly Phe Thr Leu Asn Asp Asp 

165 170 175 

Gly Arg Ser Cys Gin Asp Val Asn Glu Cys Glu Thr Glu Asn Pro Cys 

180 185 190 

Val Gin Thr Cys Val Asn Thr Tyr Gly Ser Phe He Cys Arg Cys Asp 

195 200 205 

Pro Gly Tyr Glu Leu Glu Glu Asp Gly He His Cys Ser Asp Mel Asp 

210 215 220 

Glu Cys Ser Phe Ser Glu Phe Leu Cys Gin His Glu Cys Val Asn Gin 
225 230 235 240 

Pro Gly Ser Tyr Phe Cys Ser Cys Pro Pro Gly Tyr Val Leu Leu Asp 

245 250 255 

Asp Asn Arg Ser Cys Gin Asp He Asn Glu Cys Glu His Arg Asn His 

260 265 270 

Thr Cys Thr Ser Leu Gin Thr Cys Tyr Asn Leu Gin Gly Gly Phe Lys 

275 280 285 

Cys He Asp Pro lie Ser Cys Glu Glu Pro Tyr Leu Leu lie Gly Glu 
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290 295 300 

Asn Arg Cys Mel Cys Pro Ala Glu His Thr Scr Cys Arg Asp Gin Pro 
305 310 315 320 

Phe Thr He Leu Tyr Arg Asp Met Asp Val Val Ser Gly Arg Ser Val 

325 330 335 

Pro Ala Asp He Phe Gin Met Gin Ala Thr Thr Arg Tyr Pro Gly Ala 

340 345 350 

Tyr Tyr He Phe Gin He Lys Ser Gly Asn Glu Gly Arg Glu Phe Tyr 

355 360 365 

Met Arg Gin Thr Gly Pro He Ser Ala Thr Leu Val Met Thr Arg Pro 

370 375 380 

lie Lys Gly Pro Arg Asp He Gin Leu Asp Leu Glu Met He Thr Val 
385 390 395 400 

Asn Thr Val He Asn Phe Arg Gly Ser Ser Val He Arg Leu Arg He 

405 410 415 

Tyr Val Ser Gin Tyr Pro Phe 
420 



<210> 10 
<2H> 1269 
<212> DNA 
<213> Mus musculus 

<400> 10 

cagtgcacaa acggctttga cctggaccgc cagtcaggac agtgtctaga tattgatgaa 60 
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tgccggacca 


i ccc t gaggc 


t t rr t rr t rr rx rx 

t igtcgtggg 


i a t 1 i g t gc a 


t ccc i cgaac 


caacccagig 


aca t cc t ac t 


caggccca t a 


cccagcagcg 


acga 1 1 1 c aa 


ggcetc t tgt 


c tgeege 1 1 1 


gtggatgtgg 


acgagtgtgc 


aacagac tea 


aacac tgaag 


gaggt tacac 


c tgc t cc tgc 


tgcc tagata 


t tgatgaatg 


tege tatggt 


ggatcc tat t 


cc tgtacatg 


caaccc tggt 


caagatgtga 


aegagtgega 


aac tgagaat 


ggc tc 1 1 1 ca 


tctgccgctg 


tgacccagga 


agtgat atgg 


acgagtgcag 


c t tc tccgag 


ccgggc teat 


ac 1 1 c tgc t c 


gtgece t cca 


igccaggata 


tcaatgaatg 


tgagcaccga 


i a c a a t c t a c 


aagggggc 1 1 


caaa tgt a 1 1 


c tgai tggtg 


aaaacege tg 


tatgtgtcc t 


t tcaccatcc 


tgtatcggga 


catggatgtg 


t tecagatge 


aagcaacaac 


ccgataccc t 


ggcaacgagg 


gtcgagagt t 


etatatgegg 


atgacacgcc 


ccatcaaagg 


gcctcgggac 


aacac tgtca 


tcaact tcag 


aggcagc tec 


tatccgt tc 







<21(T' 11 
\211> 35 
\212-> DNA 

<213 x > Artificial Sequence 
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gacatgatgt gtgtcaacca gaatggcggg 120 
tatcgagggc cttactcaaa tccctactct 180 
gccccaccag taccagcttc caactacccc 240 
gggtatcaga tggatgaagg caaccagtgt 300 
caccagtgca accctaccca gatctgtatc 360 
accgatgggt actggcttct ggaagggcag 420 
tactgccagc agctctgtgc aaatgttcca 480 
ttcaccctca acgacgatgg aaggtcttgc 540 
ccctgtgttc agacctgtgt caacacctat 600 
tatgaacttg aggaagatgg cattcactgc 660 
ttcctctgtc aacacgagtg tgtgaaccag 720 
ggctacgtcc tgttggatga taaccgaagc 780 
aaccacacgt gtacctcact geagacttge 840 
gatcccatca gctgtgagga gecttatctg 900 
gctgagcaca ccagctgcag agaccagcca 960 
gtgtcaggac gctccgttcc tgetgacate 1020 
ggtgcctatt acat t t tcca gatcaaatc t 1080 
caaacagggc ctatcagtgc caccctggtg 1140 
atccagctgg acttggagat gatcactgtc 1200 
gtgatccgac tgeggatata tgtgtcgcag 1260 

1269 
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<220'.» 

<223> Description of Artificial Sequence:Primer 
<400 > 1 1 

cgattgaati ctagacctgc cicgagnnnn nnnnn 35 

<210> 12 
<211> 27 
<2 1 2 > DNA 

<213> Artificial Sequence 
<220^ 

<223> Description of Artificial Sequence :mA55 Rl primer 
<400> 12 

cgtttgtgca ctgctgctgt gcattcc 27 
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Document Name: Specification 
Title of the Invention: 

A novel polypeptide, a method for preparation of it, a cDNA encoding 
the polypeptide 
Claims 

1. Substantially purified form of the polypeptide that comprising 
the amino-acid sequence shown in SEQ ID NO. 1,4,6, 9, 11 or 14, homologue 
thereof, fragment thereof or homologue of the fragment. 

2. A polypeptide according to claim 1 that comprising the 
amino-acid sequence shown in SEQ ID NO. 1, 4, 6, 9, 11 or 14. 

3. A cDNA encoding the polypeptide according to claim 1. 

4. A cDNA according to claim 3 that comprising the nucleotide 
sequence shown in SEQ ID NO. 2, 5, 7, 10, 12 or 15 or a fragment cDNA selectively 
hybridized to the cDNA. 

5. A cDNA according to claim 3 that comprising the nucleotide 
sequence shown in SEQ ID NO. 3, 8 or 1 3 or a fragment cDNA selectively 
hybridized to the cDNA. 

6. A replication or expression vector carrying the cDNA according 
to claim 3 to 5 . 

7- A host cell transformed with the replication or expression 

vector according to claim 6. 

8. A method for producing the polypeptide according to claim 1 or 
2 which comprises culturing a host cell according to claim 7 under a condition 
effective to express the polypeptide according to claim 1 or 2 . 

9. A monoclonal or polyclonal antibody against the polypeptide 
according to claim 1 or 2. 

10. A pharmaceutical composition containing the polypeptide 
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according to claim 1 or 2 or the antibody according to claim 9 , in association 
with pharmaceutical^ acceptable diluent and/or carrier. 

11. A pharmaceutical composition for the treatment of abnormal 
growth of smooth muscle cell, containing a polypeptide according to claim 
1 or 2, in association with a pharmaceut ical ly acceptable diluent and/or 
carrier . 

12. A pharmaceutical composition for the treatment of 
arteriosclerosis, restenosis after PTCA or myosarcoma, containing the 
polypeptide according to claim 1 or 2 , in association with a pharmaceut ical ly 
acceptable diluent and/or carrier. 

13. A screening method for an antagonist or agonist of the 
polypeptide according to claim 1 or 2 with using the said polypeptide. 



Detailed Description of the Invention 
Field of the Invention 

The present invention provides a novel polypeptide, a method for 
preparation of them, a cDNA encoding the polypeptide, a vector containing 
it, a host cell transformed with the vector, an antibody of the peptide, 
a pharmaceutical composition containing the polypeptide or the antibody, 
a screening method with using the polypeptide. 

Problems to be Solved 

The present inventors investigated to find novel factors 
(polypeptides) which are useful for study or for the treatment or diagnosis 
of diseases induced by abnormal proliferation of smooth muscle. Especially, 
we had aimed secreted proteins and membrane proteins which have signal 
sequences for secretion. 

Background of the Invention 

In modern medical research, cardiovascular biology is a field that 
attracts considerable attention because cardiovascular disease is the 
leading cause of mortality. Cardiovascular research has revealed important 
facts about neointimal formation and arterial remodeling, both of which are 
thought to contribute to plaque formation in atherosclerosis and blood vessel 
narrowing. For example, there are three aspects of the cellular process in 
hypercholes terolaemia induced blood vessel damage in animal models that mimic 
human development of arteriosclerotic coronary disease. The three elements 
that form lesions on the artery wall are: a) proliferation of smooth muscle 
cells, macrophages and lymphocytes, b) formation of connective tissues 
(mainly elastic fiber proteins, collagen and proteoglycans made by smooth 
muscle cells in a process similar to scar formation), and c) the accumulation 
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of lipid and cholesterol in the newly formed connective tissue matrices. 
The exact sequence of the three damaging elements are debatable, but it is 
clear that the abnormal dedi f f erentiat ion , redi f f erentiation and growth of 
smooth muscle cells contribute structurally to vessel damage. Moreover, 
another significant pathological process that involves abnormal smooth 
muscle cell growth is restenosis after Percutaneous transluminal coronary 
angioplasty (PTCA) . 

The present inventors made reasonable efforts, by isolation of the 
molecules related to participation of smooth muscle cells in angiogenesis , 
for the aim to utilize them for regulation of abnormal proliferation of smooth 
muscle cells such like described above. 

In order to obtain a certain polypeptide or cDNA coding for the same, 
there has been generally employed a method composed of detecting the aimed 
biological activity in a tissue or a cell culture medium, then identifying 
a polypeptide as substance of the activity through the isolation and 
purification and isolating a gene encoding the polypeptide or 
expression-cloning method to isolate a gene by access of the biological 
activity of the polypeptide encoded by it. 

Because in many cases, however, physiologically active polypeptides 
have various biological activities, when taking the method to approaches 
based on a certain activity to isolate a gene, it has increasingly been 
happened that the gene is turned out to be identical to a known gene which 
has another activity after spending much efforts to isolate it. And because, 
in many cases, biological factors are produced only in a very slight amount 
or only in a specific condition, it is often made difficult to isolate and 
purify a factor and detect its biological activity. 

Related Arts 
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Recent rapid developments in techniques for constructing cDNAs and 
sequencing techniques have made it possible to quickly sequence a large amount 
of cDNAs . By utilizing these techniques, a process, which comprises 
constructing cDNAs at random, identifying the nucleotide sequences thereof, 
expressing novel polypeptides encoded by them, is now in progress. Although 
this process is advantageous in that a gene can be cloned and information 
regarding its nucleotide sequence can be obtained without any biochemical 
or genetic analysis, the target gene can be discovered thereby only 
accidentally in many cases. 

Means for solving the problems 

The present inventors have studied cloning method of genes coding 
proliferation and/or differentiation factors functioning in hematopoietic 
systems and immune systems. Focusing their attention on the fact that most 
of the secretory proteins such as proliferation and/or differentiation 
factors (for example various cytokines) and membrane proteins such as 
receptors thereof (hereafter these proteins will be referred to generally 
as secretory proteins and the like) have sequences called signal peptides 
in the N-termini, the inventors conducted extensive studies on a process 
for efficiently and selectively cloning a gene coding for a signal peptide. 
Finally, we have successfully invented a screening method for cDNAs having 
sequence encoding signal peptides, we called the method as signal sequence 
trap (SST) (Japanese Patent Application No. 6-13951). 

We also developed yeast SST method on the same concept. By the method using 
yeast, genes including sequence encoding signal peptide can be identified 
more easily and effectively (USP No. 5,536,637). 

By using the present method, the present inventors identified novel 
secreted protein produced by mouse embryonic heart and human kidney and a 





cDNA fragments encoding them, and by using the sequence information of the 
cDNA fragments they isolated each full-length cDNA from mouse embryonic heart 
and human kidney. And they discovered that the polypeptides had functions 
to suppress smooth muscle cells. 

The present cDNA sequence was identified as a clone mouse A55 and 
isolated from cDNA library derived from mouse embryonic heart based on genetic 
information obtained by using the Yeast SST method described above . The clone , 
mouse A55 is a full-length cDNA encoding a secreted polypeptide (which is 
called mouse A55 polypeptide here). 

The present cDNA sequence was named as a clone human A55 and isolated 
from cDNA library derived from human brain based on genetic information 
obtained from human kidney by using the Yeast SST method described above. 
The clone, human A55 is a full-length cDNA encoding a secreted polypeptide 
(which is called human A55 polypeptide here). 

There was no DNA sequence which is identical to that of mouse and human 
A55 of the present invention, when DNA sequence of mouse and human A55 were 
compared with data base by BLASTN and FASTA. And there was no polypeptides 
which is identical to that of mouse and human A55 of the present invention, 
when amino acid sequence of mouse and human A55 was compared with data base 
by BLASTX, BLASTP and FASTA. So the polypeptides of the present invention 
are considered to be novel . 

The inventors discovered that the polypeptides had functions to suppress 
smooth muscle cells. Accordingly, the polypeptides may be useful for 
treatment of diseases related to abnormal proliferation of smooth muscle 
cells, for example, arteriosclerotic coronary disease , neointimal formation 
which results in restenosis after percutaneous transluminal coronary 
angioplasty and myosarcoma. 
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Constitution of the Invention 

The present invention provides: 
1 ) a polypeptide having an amino acid sequence shown in SEQ 

ID NO. 1 , 4 , 6 , 9 , 11 or 14 , 

2) a cDNA encoding the polypeptide described above (1), 

3) a cDNA having an nucleotide sequence shown in SEQ ID NO. 2, 5, 7, 10, 
12 or 15, 

4) a cDNA that consists of an nucleotide sequence shown in SEQ ID NO. 
3 , 8 or 13 . 

Detailed Discliption 

The present invention is concerned with a polypeptide that consists 
of the amino acid sequence shown in SEQ ID NO. 1, 4, 6, 9, 11 or 14 in 
substantially purified form, a homologue thereof, a fragment of the sequence 
and a homologue of the fragment . 

Further, the present invention is concerned with a cDNA encoding the 
above peptides. More particularly the present invention is provided cDNA 
having the nucleotide sequence shown in SEQ ID NO. 2, 5, 7, 10, 12 or 15, 
and cDNA containing a fragment which is selectively hybridizing to the cDNA 
that consists of nucleotide sequence shown in SEQ ID NO. 2, 5, 7, 10, 12 
or 15. Complementary sequence of the above nucleotide sequence is also 
included in cDNA selectively hybridized. Hybridization are performed in an 
stringent condition. 

A polypeptide comprising amino acid sequence shown in SEQ ID NO. 1, 
4, 6, 9, 11 or 14 in substantially purified form will generally comprise 
the polypeptide in a preparation in which more than 90%, e.g. 95%, 98% cr 
99% of the polypeptide in the preparation is that of the SEQ ID NO. 1, A, 
6 , 9 , 11 or 14 . 
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A homologue of polypeptide comprising amino acid sequence shown in 
SEQ ID NO. 1, 4, 6, 9, 11 or 14 will be generally at least 70%, preferably 
at least 80 or 90% and more preferably at least 95% homologous to the 
polypeptide of SEQ ID NO. 1 over a region of at least 20, preferably at least 
30, for instance 40, 6 0 or 1 0 0 more cont iguous amino ac ids . Such a polypeptide 
homologue will be referred to a polypeptide of the present invention. 

Generally, a fragment of polypeptide comprising amino acid sequence 
shown in SEQ ID NO. 1, 4, 6, 9, 11 or 14 or its homologues will be at least 
10, preferably at least 15, for example 20 , 25, 30 , 40 , 50 or 60 amino acids 
in length, and are also referred to by the term "a polypeptide of the present 
invention" . 

A cDNA capable of selectively hybridizing to the cDNA comprising 
nucleotide sequence shown in SEQ ID NO. 2,5,7, 10, 12 or 15 will be generally 
at least 70%, preferably at least 80 or 90% and more preferably at least 
95% homologous to the cDNA of SEQ ID NO. 2, 5, 7, 10, 12 or 15 over a region 
of at least 20, preferably at least 30, for instance 40, 60 or 100 or more 
contiguous nucleotides. Such cDNA will be referred to M a cDNA of the present 
invention" . 

Fragments of the cDNA comprising nucleotide sequence shown in SEQ ID 
NO. 2, 5, 7, 10, 12 or 15 will be at least 10, preferably at least 15, for 
example 20 , 25 , 30 or 40 nucleotides in length, and will be also referred 
to "a cDNA of the present invention" as used herein. 

A further embodiment of the present invention provides replication 
and expression vectors carrying cDNA of the invention. The vectors may be, 
for example, plasmid, virus or phage vectors provided with an origin of 
replication, optionally a promoter for the expression of the said cDNA and 
optionally a regulator of the promoter. The vector may contain one or more 
selectable marker genes, for example a ampicillin resistance gene. The 



vector may be used in vitro, for example of the production of RNA corresponding 
to the cDNA, or used to transfect or transfect a host cell. 

A further embodiment of the present invention provides host cells 
transformed with the vectors for the replication and expression of the cDNA 
of the invention, including the nucleotide sequence shown in SEQ ID NO. 2, 
3, 5, 7, 8, 10, 12, 13 or 15 or the open reading frame thereof. The cells 
will be chosen to be compatible with the vector and may for example be bacterial , 
yeast, insect or mammalian. 

A further embodiment of the present invention provides a method of 
producing a polypeptide which comprises culturing host cells of the present 
invention under conditions effective to express a polypeptide of the 
invention. Preferably, in addition, such a method is carried out under 
conditions in which the polypeptide of the invention is expressed and then 
produced from the host cells. 

cDNA of the present invention may also be inserted into the vectors 
described above in an antisense orientation in order to proved for the 
production of antisense RNA. Such antisense RNA may be used in a method of 
controlling the levels of a polypeptide of the invention in a cell. 

The invention also provides monoclonal or polyclonal antibodies 
against a polypeptide of the invention. The invention further provides a 
process for the production of monoclonal or polyclonal antibodies to the 
polypeptides of the invention. Monoclonal antibodies may be prepared by 
common hybridoma technology using polypeptides of the invention or fragments 
thereof, as an immunogen . Polyclonal antibodies may also be prepared by 
common means which comprise inoculating host animals, for example a rat or 
a rabbit, with polypeptides of the invention and recovering immune serum. 

The present invention also provides pharmaceutical compositions 
containing a polypeptide of the invention, or an antibody thereof, in 





association with a pharmaceutically acceptable diluent and/or carrier. 

The polypeptide of the present invention includes that which a part 
of their amino acid sequence is lacking (e.g., a polypeptide comprised of 
the only essential sequence for revealing a biological activity in an amino 
acid sequence shown in SEQ ID N0.1), that which a part of their amino acid 
sequence is replaced by other amino acids (e.g., those replaced by an amino 
acid having a similar property) and that which other amino acids are added 
or inserted into a part of their amino acid sequence, as well as those having 
the amino acid sequence shown in SEQ ID NO. 1, 4, 6, 9, 11 or 14. 

As known well, there are one to six kinds of codon as that encoding 
one amino acid (for example, one kind of codon for Methioine (Met), and six 
kinds of codon for leucine (Leu) are known). Accordingly, the nucleotide 
sequence of cDNA can be changed in order to encode the polypeptide having 
the same amino acid sequence. 

The DNA of the present invention, specified in (2) includes a group 
of every nucleotide sequences encoding polypeptides (1) shown in SEQ ID NO. 
1, 4, 6, 9, 11 or 14. There is a probability that yield of a polypeptide 
is improved by changing a nucleotide sequence. 

The cDNA specified in (3) is the embodiment of the cDNA shown in (2), 
and indicate the sequence of natural form. 

The cDNA shown in (4) indicates the sequence of the cDNA specified 
in (3) with natural non- trans la t iona 1 region. 

cDNA carrying nucleotide sequence shown in SEQ ID NO. 3, 8 or 13 is 
prepared by the following method: 

Brief description of Yeast SST method (see USP No. 5 , 536 , 637) is as 
f ol lows . 

Yeast such as Saccharomyces cerevisiae should secrete invertase intc 
the medium in order to take sucrose or raffinose as a source of energy or 
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carbon . 

(Invertase is an enzyme to cleave raffinose into sucrose and melibiose, 
sucrose into fructose and glucose.) 

It is known that many known mammalian signal peptide make yeast secrete its 
invertase . 

From these knowledge, SST method was developed as a screening method to find 
novel signal peptide which make it possible can to secrete yeast invertase 
from mammalian cDNA library. SST method uses yeast growth on raffinose 
medium as a marker. Non-secretory type invertase gene SUC2 ( GENBANK 
Accession No. V 01311) lacking initiation codon ATG was inserted to yeast 
expression vector to prepare yeast SST vector pSUC2 . 

In this expression vector, ADH promoter, ADH terminator (both were derived 
from AAH5 plasmid (Gammerer, Methods in Enzymol . 101, 192-201, 1983)), 2u 
ori (as a yeast replication origin), TRP 1 (as a yeast selective marker), 
ColEl ori(as a E. Coli replication origin) and ampicillin resistance gene 
(as a drug resistance marker) were inserted. 

Mammalian cDNA was inserted into the upstream of SUC2 gene to prepare yeast 
SST cDNA library. Yeast lacking secretory type invertase, was transformed 
with this library. 

If inserted mammalian cDNA encodes a signal peptide, yeast could be survive 
in raffinose medium as a result of restoring secretion of invertase. 
Only to culture yeast colonies, prepare plasmids and determine the nucleotide 
sequence of the insert cDNAs, it is possible to identify novel signal peptide 
rapidly and easily. 

Preparation of yeast SST cDNA library is as follows: 
(1) mRNA is isolated from the targeted cells, second- strand synthesis is 
performed by using random primer with certain restriction enzyme (enzyme 
I) recognition site, 
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( 2 ) double- strand cDNA is ligated to adapter containing certain restriction 
endonuclease (enzyme II) recognition site, differ from enzyme I, digested 
with enzyme I and fractionated in a appropriate size, 

(3) obtained cDNA fragment is inserted into yeast expression vector on the 
upstream region of invertase gene which signal peptide is deleted and the 
library was transformed. 

Detailed description of each step is as follows: 
(1) mRNA is isolated from mammalian organs and cell lines stimulate them 
with appropriate stimulator if necessary ) by known methods (Molecular Cloning 
(Sambrook, J . , Fritsch, E. F. and Maniatis, T. , Cold Spring Harbor Laboratory 
Press, 1989) or Current Protocol in Molecular Biology (F. M. Ausubel et al, 
John Wiley & Sons, Inc.) if not remark especially). 

Mouse embryonic heart is chosen as a tissue source. Double-strand 
cDNA synthesis using random primer is performed by known methods. 

Any sites may be used as restriction endonuclease recognition site 

I which is linked to adapter and restriction endonuclease recognition site 

II which is used in step (2), if both sites are different each other. 
Preferably, Xhol is used as enzyme I and EcoRI as enzyme II. 

In step ( 2 ) , cDNA is created blunt -ends with T4 DNA polymerase , ligated 
enzyme II adapter and digested with enzyme I. Fragment cDNA is analyzed with 
agarose-gel electrophoresis and is selected cDNA fraction ranging in size 
from 300 to 800 bp. As mentioned above, any enzyme may be used as enzyme 
II if it is not same the enzyme I. 

In step (3), cDNA fragment obtained in step (2) is inserted into yeast 
expression vector on the upstream region of invertase gene which signal 
peptide is deleted. E . coli transformed with the expression vector. Many 
vectors are known as yeast expression plasmid vector. For example, YEp24 
is also functioned in E . Coli. Preferably pSUC2 as described above is used. 



Many host E. Coli strains are known for transformation, preferably 
DH103 competent cell is used. Any known transformation method is available, 
preferably it is performed by elec tropola tion method. Transformant is 
cultured by known methods to obtain cDNA library for yeast SST method. 

However not every All of the clones do not contain cDNA fragment. 
Further all of the gene fragments do not encode unknown signal peptides. 
It is therefore necessary to screen a gene fragment encoding for an unknown 
signal peptide from the library. 

Therefore, screening of fragments containing a sequence encoding an 
appropriate signal peptide is performed by transformation of the cDNA library 
into Saccharomyces cerevisiae (e.g. YT455 strain) which lack invertase (it 
may be prepared by known methods . ) . 

Transformation of yeast is performed by known methods, e.g. lithium acetate 
method. Transformant is cultured in a selective medium, then transferred 
to a medium containing raffinose as a carbon source. Survival colonies are 
selected and then prepared plasmid. Survival colonies on a r a f f inose-medium 
indicates that some signal peptide of secretory protein was inserted to this 
c lone . 

Isolated positive clones is determined the nucleotide sequence. As 
to a cDNA encodes unknown protein, full-length clone may be isolated by using 
cDNA fragment as a probe and then determined to obtain full-length nucleotide 
sequence. These manipulation is performed by known methods. 

Once the nucleotide sequences shown in SEQ ID NO. 2, 5, 7, 10, 12 or 
15 are determined partially or preferably fully, it is possible to obtain 
cDNA encode mammalian protein itself, homologue or subset of the invention. 
cDNA library or mRNA derived from mammals was screened by PCR with any 
synthesized oligonucleotide primers or by hybridization with any fragment 
as a probe. It is possible to obtain cDNA encodes other mammalian homologue 
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protein from other mammalian cDNA or genome library. 

If a cDNA obtained above contains a nucleotide sequence of cDNA fragment 
obtained by SST (or concensus sequence thereof), it will be thought that 
the cDNA encodes signal peptide. So it is clear that the cDNA will be 
full-length or almost full. 

(All signal peptides exist at N-termini of a protein and are encoded at 
S'-temini of open reading frame of cDNA.) 

The confirmation may be carried out by Northern analysis with the 
said cDNA as a probe. It is thought that the cDNA is almost complete length, 
if length of the cDNA is almost the same length of the mRNA obtained in the 
hybridizing band. 

The present invention supplies full-length protein and also its mature 
protein sequence. The full-length protein sequence deduced from nucleotide 
sequences shown in SEQ ID NO. 2, 7 or 12. 

Mature proteins are obtained by expressing full-length cDNAs shown in SEQ 
ID NO. 3, 8 or 13 in mammalian cells or other host cells. 

Mature protein sequences are deduced from their full-length amino acid 
sequences . 

Once the nucleotide sequences shown in SEQ ID NOs . 2, 5, 7, 10, 12 
or 15 are determined, cDNAs of the present invention are obtained by chemical 
synthesis, or by hybridization making use of nucleotide fragments which are 
chemically synthesized as a probe. 

Furthermore, cDNAs of the present invention are obtained in desired amount 
by transforming a vector that contains the cDNA into a proper host, and 
culturing the t r ans f orman t . 

The polypeptides of the present invention may be prepared by: 

(1) isolating and purifying from an organism or a cultured cell, 

(2) chemically synthesizing, or 



(3) using recombinant DNA technology, 

preferably, by the method described in (3) in industrial production. 

Examples of expression system for ( hos t-vec toer system) producing a 
polypeptide by using recombinant DNA technology are the expression systems 
of bacteria, yeast, insect cells and mammalian cells. 

In the expression of the polypeptide, for example, in E. Coli, the 
expression vector is prepared by adding the initiation codon ( ATG ) to 5' 
end of a DNA encoding mature peptide, connecting the DNA thus obtained to 
the downstream of a proper promoter (e.g., trp promoter, lac promoter, >. 
PL promoter, T7 promoter etc.), and then inserting it into a vector (e.g., 
pBR322, pUC18, pUC19 etc.) which functions in an E. coli strain. 

Then, an E. coli strain (e.g. , E. coli DH1 strain, E. coli JM109 strain, 
E. coli HB101 strain, etc. ) which is transformed with the expression vector 
described above may be cultured in a appropriate medium to obtain the desired 
polypeptide. When a signal peptide of bacteria (e.g. , signal peptide of pel 
B) is utilized, the desired polypeptide may be also released in periplasm. 
Furthermore, a fusion protein with other polypeptide may be also produced 
easily . 

In the expression of the polypeptide , for example, in a mammalian eel 1 s , 
for example, the expression vector is prepared by inserting the DNA encoding 
cDNA shown in SEQ ID NO. 3, 8 or 13 into the downstream of a proper promoter 
(e.g., SV40 promoter, LTR promoter, me t al lot hionein promoter etc.) in a 
proper vector (e.g., retrovirus vector, papilloma virus vector, vaccinia 
virus vector, SV40 vector, etc. ) a proper mammalian cell (e.g. , monkey COS-7 
cell, Chinese hamster CHO cell, mouse L cell etc.) is transformed with the 
expression vector thus obtained, and then the trans formant is cultured in 
a proper medium to get a desired polypeptide in the culture medium. Further, 
fusion protein may be produced by linking cDNA fragment encoding other 





polypeptide such as Fc portion of an antibody. The polypeptide thus obtained 
may be isolated and purified by conventional biochemical methods. 

Effect of the invention 

The polypeptides of the present invention and cDNA encoding them are 
expected to exhibit one or more of the uses or biological activities ( including 
those associated with assays cited herein) identified below. 
Uses or activities described for proteins of the present invention may be 
provided by administration or use of such proteins or by administration or 
use of cDNA encoding them {such as, for example, in gene therapies or vectors 
suitable for introduction of DNA ) . 

We have been confirmed that the said polypeptide possess the suppressing 
activity on the differentiation of vascular smooth muscle cells. 
Accordingly, the polypeptides may be useful for treatment of diseases related 
to abnormal proliferation of smooth muscle cells, for example, 
arteriosclerotic coronary disease, neointimal formation which results in 
restenosis after percutaneous transluminal coronary angioplasty and 
myosarcoma . 

But not limit the present invention: 

<Cytokine activity and cell proliferation/differentiation activity> 

The protein of the present invention may exhibit cytokine, cell 
proliferation (either inducing or inhibiting) or cell differentiation 
(either inducing or inhibiting) activity or may induce production of other 
cytokines in certain cell populations. 

Many protein factors discovered to date, including all known cytokines, have 
exhibited activity in one or more factor dependent cell proliferation assays, 
and hence the assays serve as a convenient confirmation of cytokine activity. 
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The activity of a protein of the present invention is evidenced by any one 
of a number of routine factor dependent cell proliferation assays for cell 
lines . 

< Immune stimulating/ suppressing activity> 

The protein of the present invention may also exhibit immune 
stimulating or immune suppressing activity. The protein may be useful in 
the treatment of various immune deficiencies and disorders (including severe 
combined immunodeficiency (SCID)), e.g., in regulating (up or down) growth 
and proliferation of T and/ or B lymphocytes , as well as effecting the cytolytic 
activity of NK cells and other cell populations. 

These immune deficiencies may be genetic or be caused by viral (e.g. HIV) 
as well as bacterial or fungal infections, or may result from autoimmune 
disorders . 

More specifically, infectious diseases causes by viral, bacterial, fungal 
or other infection may be treatable using the protein of the present invention, 
including infections by HIV, hepatitis viruses , herpes viruses , mycobacteria , 
leshmania, malaria and various fungal infections such as Candida. 
Of course, in this regard, a protein of the present invention may also be 
useful where a boost to the immune system generally would be indicated, i.e. , 
in the treatment of cancer. 

Such a protein of the present invention may also to be useful in the 
treatment of allergic reactions and conditions, such as asthma or other 
respiratory problems . 

The protein of the present invention may also suppress chronic or acute 
inflammation, such as, for example, that associated with infection (such 
as septic shock or systemic inflammatory response syndrome (SIRS)), 
inflammatory bowel disease, Crohn ' s disease or resulting from over production 
of cytokines such as TNF or IL-I (such as the effect demonstrated by II- 
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11 ) . 

< Hematopoiesis regulating activity> 

The protein of the present invention may be useful in regulation of 
hematopoiesis and, consequently, in the treatment of myeloid or lymphoid 
cell deficiencies. 

Even marginal biological activity in support of colony forming cells or of 
factor-dependent cell lines indicates involvement in regulating 
hematopoiesis . 

The said biological activities are concerned with the following all 
or some example(s). 

e.g. in supporting the growth and proliferation of erythroid progenitor cells 
alone or in combination with other cytokines, thereby indicating utility, 
for example, in treating various anemias or for use in conjunction with 
irradiation/chemotherapy to stimulate the production of erythroid 
precursors and/or erythroid cells; 

in supporting the growth and proliferation of myeloid cells such as 
granulocytes and monocytes /macrophages (i.e., traditional CSF activity) 
useful, for example, in conjunction with chemotherapy to prevent or treat 
consequent mye lo- suppres sion ; 

in supporting the growth and proliferation of megakaryocytes and consequently 
of platelets thereby allowing prevention or treatment of various platelet 
disorders such as thrombocytopenia, and generally for use in place of or 
complimentary to platelet transfusions; 

and/or in supporting the growth and proliferation of hematopoietic stem cells 
which are capable of maturing to any and all of the above-mentioned 
hematopoietic cells and therefore find therapeutic utility in various stem 
cell disorders (such as those usually treated with transplantation, including, 
without limitation, aplastic anemia and paroxysmal nocturnal 
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hemoglobinuria), as well as in repopulating the stem cell compartment post 
irradiation/chemotherapy, either in-vivo or ex-vivo (i.e. in conjunction 
with bone marrow transplantation) as normal cells or genetically manipulated 
for gene therapy. 

Suitable assays for proliferation and differentiation of various 
hematopoietic lines are cited above. 

The activity of a protein of the invention may, among other means, 
be measured by the following methods : 
<T issue generation/ re generation activity> 

The protein of the present invention also may have utility in 
compositions used for bone, cartilage, tendon, Ligament and/or nerve tissue 
growth or regeneration, as well as for wound healing and tissue repair, and 
in the treatment of bums, incisions and ulcers. 

The protein of the present invention, which induces cartilage and/or bone 
growth in circumstances where bone is not normally formed, has application 
in the healing of bone fractures and cartilage damage or defects in humans 
and other animals. 

Such a preparation employing the protein of the invention may have 
prophylactic use in closed as well as open fracture reduction and also in 
the improved fixation of artificial joints. 

De novo bone formation induced by an osteogenic agent contributes to the 
repair of congenital, trauma induced, or oncologic resection induced 
craniofacial defects, and also is useful in cosmetic plastic surgery. 

The protein of this invention may also be used in the treatment of 
periodontal disease, and in other tooth repair processes. Such agents may 
provide an environment to attract bone-forming cells, stimulate growth of 
bone-forming cells or induce differentiation of progenitors of bone-forming 
cells . 
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The protein of the invention may also be useful in the treatment of 
osteoporosis or osteoarthritis, such as through stimulation of bone and/or 
cartilage repair or by blocking inflammation or processes of tissue 
destruction (collagenase activity, osteoclast activity, etc.) mediated by 
inflammatory processes. 

Another category of tissue regeneration activity that may be 
attributable to the protein of the present invention is tendon / 1 igament 
formation. A protein of the present invention, which induces 
tendon/ligament-like tissue or other tissue formation in circumstances where 
such tissue is not normally formed, has application in the healing of tendon 
or ligament tears, deformities and other tendon or ligament defects in humans 
and other animals . 

Such a preparation employing a tendon /Ligament- 1 ike tissue inducing protein 
may have prophylactic use in preventing damage to tendon or ligament tissue, 
as well as use in the improved fixation of tendon or ligament to bone or 
other tissues, and in repairing defects to tendon or ligament tissue. 
De novo tendon/ 1 igament- like tissue formation induced by a composition of 
the present invention contributes to the repair of congenital , trauma induced, 
or other tendon or ligament defects of other origin, and is also useful in 
cosmetic plastic surgery for attachment or repair of tendons or ligaments. 
The compositions of the present invention may provide an environment to 
attract tendon- or 1 igament - forming cells, stimulate growth of tendon- or 
ligament-forming cells, induce differentiation of progenitors of tendon- 
or ligament- forming cells, or induce growth of tendon Ligament cells or 
progenitors ex vivo for return in vivo to effect tissue repair. 
The compositions of the invention may also be useful in the treatment of 
tendinitis, carpal tunnel syndrome and other tendon or ligament defects. 
The compositions may also include an appropriate matrix and/cr sequestering 
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agent as a carrier as is well known in the art. 

The protein of the present invention may also be useful for 
proliferation of neural cells and for regeneration of nerve and brain tissue, 
i.e. for the treatment of central and peripheral nervous system diseases 
and neuropathies, as well as mechanical and traumatic disorders, which 
involve degeneration, death or trauma to neural cells or nerve tissue. 
More specifically, a protein may be used in the treatment of diseases of 
the peripheral nervous system, such as peripheral nerve injuries, peripheral 
neuropathy and localized neuropathies, and central nervous system diseases, 
such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 
lateral sclerosis, and Shy-Drager syndrome. 

Further conditions which may be treated in accordance with the present 
invention include mechanical and traumatic disorders, such as spinal cord 
disorders, head trauma and cerebrovascular diseases such as stroke. 
Peripheral neuropathies resulting from chemotherapy or other medical 
therapies may also be treatable using a protein of the invention. 

It is expected that the protein of the present invention may also 

exhibit activity for generation of other tissues, such as organs (including, 
for example, pancreas, liver, intestine, kidney, skin, endothelium), muscle 
(smooth, skeletal or cardiac) and vascular (including vascular endothelium) 

tissue, or for promoting or supressing the proliferation of cells comprising 

such tissues. Part of the desired effects may be by inhibition of fibrotic 

scarring to allow normal tissue to regenerate. 

A protein of the present invention may also be useful for gut protection 

or regeneration and treatment of lung or liver fibrosis, reperfusion injury 

in various tissues, and conditions resulting from systemic cytokine damage. 

<Act i vin / Inhibin activity> 

The protein of the present invention may also exhibit activin- or 
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inhibin-related activities. Inhibins are characterized by their ability to 
inhibit the release of follicle stimulating hormone ( FS H ) , while activins 
and are characterized by their ability to stimulate the release of follicle 
stimulating hormone { FSH ) . 

Thus, a protein of the present invention, alone or in heterodimers with a 
member of the inhibin *a family, may be useful as a contraceptive based on 
the ability of inhibins to decrease fertility in female mammals and decrease 
spermatogenesis in male mammals. Administration of sufficient amounts of 
other inhibins can induce infertility in these mammals. 

Alternatively, the protein of the invention, as a homodimer or as a heterodimer 
with other protein subunits of the inhibin-*b group, may be useful as a 
fertility inducing therapeutic, based upon the ability of activin molecules 
in stimulating FSH release from cells of the anterior pituitary. 
See for example, USP 4 , 798 , 885. The polypeptide of the invention may also 
be useful for advancement of the onset of fertility in sexually immature 
mammals, so as to increase the lifetime reproductive performance of domestic 
animals such as cows, sheep and pigs. 
<Chemot ac t ic / c nemo ki net ic ac t ivi ty> 

A protein of the present invention may have chemotactic or chemokinetic 
activity (e.g., act as a chemokine ) for mammalian cells, including, for 
example, monocytes, neutrophils, T-cells, mast cells, eosinophils and/or 
endothelial cells. 

Chemotactic and chemokinetic proteins can be used to mobilized or attract 
a desired cell 'population to a desired site of action. Chemotactic or 
chemokinetic proteins provide particular advantages in treatment of wounds 
and other trauma to tissues, as well as in treatment of localized infections. 
For example, attraction of lymphocytes, monocytes or neutrophils to tumors 
or sites of infection may result in improved immune responses against the 
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tumor or infecting agent . 

A protein or peptide has chemotactic activity for a particular cell 
population if it can stimulate, directly or indirectly, the directed 
orientation or movement of such cell population. Preferably, the protein 
or peptide has the ability to directly stimulate directed movement of cells. 
Whether a particular protein has chemotactic activity for a population of 
cells can be readily determined by employing such protein or peptide in any 
known assay for cell chemotaxis . 
<Hemostatic and thrombolytic activity> 

The protein of the invention may also exhibit hemostatic or thrombolyic 
activity. As a result, such a protein is expected to be useful in treatment 
of various coagulation disorders (including hereditary disorders, such as 
hemophilias) or to enhance coagulation and other hemostatic events in 
treating wounds resulting from trauma, surgery or other causes. A protein 
of the invention may also be useful for dissolving or inhibiting formation 
of thromboses and for treatment and prevention of conditions resulting 
therefrom (such as. for example, infarction or stroke). 
<Receptor /ligand activity> 

The protein of the present invention may also demonstrate activity 
as receptors, receptor ligands or inhibitors or agonists of receptor / 1 igand 
interactions. Examples of such receptors and ligands include, without 
limitation, cytokine receptors and their ligands, receptor kinases and their 
ligands, receptor phosphatases and their ligands, receptors involved in 
cell-cell interactions and their ligands (including without limitation, 
cellular adhesion molecules (such as selectins, integrins and their ligands) 
and receptor / ligand pairs involved in antigen presentation, antigen 
recognition and development of cellular and humoral immune responses). 
Receptors and ligands are also useful for screening of potential peptide 
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or small molecule inhibitors of the relevant receptor / 1 i gand interaction. 

A protein of the present invention (including, without limitation, fragments 

of receptors and ligands ) may themselves be useful as inhibitors of 

receptor/ ligand interactions. 

<Nutritional uses> 

Proteins of the present invention can also be sued as nutritional 

sources or supplements . Such uses include without limitation use as a protein 
or amino acid supplement, use as a carbon source, use as a nitrogen source 
and use as a source of carbohydrate. In such cases the protein of the present 
invention can be added to the feed of a particular organism or can be 
administered as a separate solid or liquid preparation, such as in the form 
of powder, pills, solutions, suspensions or capsules. In the case of 
microorganisms, the protein of the invention can be added to the medium in 
or on which the microorganism is cultured. 
<Cadherin/Tumor invasion suppresser activity> 

Cadherins are calcium-dependent adhesion molecules that appear to play 
ma j or roles during development , particularly in defining specific cell types . 
Loss or alteration of normal cadherin expression can lead to changes in cell 
adhesion properties linked to tumor growth and metastasis. 

Cadherin malfunction is also implicated in other human diseases, such as 
pemphigus vulgaris and pemphigus foliaceus (autoimmune blistering skin 
diseases), Crohn's disease, and some developmental abnormalities. 

The cadherin superfamily includes well over forty members, each with 
a distinct pattern of expression. All members of the superfamily have in 
common conserved extracellular repeats (cadherin domains), but structural 
differences are found in other parts of the molecule. 

The cadherin domains bind calcium to form their tertiary structure and thus 
calcium is required to mediate their adhesion. Only a few amino acids in 
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the first cadherin domain provide the basis for homophilic adhesion; 
modification of this recognition site can change the specificity of a cadherin 
so that instead of recognizing only itself, the mutant molecule can now also 
bind to a different cadherin. In addition, some cadherins engage in 
heterophilic adhesion with other cadherin. 

E-cadherin, one member of the cadherin superfamily, is expressed in 
epithelial cell types. Pathologically, if E-cadherin expression is lost in 
a tumor, the malignant cells become invasive and the cancer metastasizes. 
Transfection of cancer cell line with cDNAs expressing E-cadherin has 
reversed cancer- as socia ted changes by returning altered cell shapes to normal , 
restoring cells adhesiveness to each other and to their substrate, decreasing 
the cell growth rate, and drastically reducing anchorage-independent cell 
growth . 

Thus, reintroducing E-cadherin expression reverts carcinomas to a less 
advanced stage. It is likely that other cadherins have the same invasion 
suppresser role in carcinomas derived from other tissue types. Therefore, 
proteins of the present invention with cadherin activity, and cDNAs of the 
present invention encoding such proteins, can be used to treat cancer. 
Introducing such proteins or cDNAs into cancer cells can reduce or eliminate 
the cancerous change observed in these cells by providing normal cadherin 
expression . 

Cancer cells have also been shown to express cadherins of a different 
tissue type than their origin, thus allowing these cells to invade and 
metastasize in a different tissue in the body. Proteins of the present 
invention with cadherin activity, and cDNAs of the present invention encoding 
such proteins, can be substituted in these cells for the inappropriately 
expressed cadherins, restoring normal cell adhesive properties and reducing 
or eliminating the tendency of the cells to metastasize. 
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Additionally, proteins of the present invention with cadherin activity, 
and cDNA of the present invention encoding such proteins, can used to generate 
antibodies recognizing and binding to cadherins . 

Such antibodies can be used to block the adhesion of inappropriately expressed 
tumor-cell cadherins, preventing the cells from forming a tumor elsewhere. 
Such an ant i-cadherin antibody can also be used as a marker for the grade, 
pathological type, and prognosis of a cancer, i.e. the more progressed the 
cancer, the less cadherin expression there will be, and this decrease in 
cadherin expression can be detected by the use of a cadherin-binding antibody. 

Fragments of proteins of the present invention with cadherin activity, 
preferably a polypeptide comprising a decapeptide of the cadherin recognition 
site, and cDNAs of the present invention encoding such protein fragments, 
can also be used to block cadherin function by binding to cadherins and 
preventing them from binding in ways that produce undesirable effects. 
Additionally, fragments of proteins of the present invention with cadherin 
activity, preferably truncated soluble cadherin fragments which have been 
found to be stable in the circulation of cancer patients, and polynulceot ides 
encoding such protein fragments, can be used to disturb proper cell-cell 
adhesion . 

<Tumor Inhibiting activity> 

In addition to the activities described above for immunological 
treatment or prevention of tumors, the protein of the invention may exhibit 
other anti-tumor activities. The protein may inhibit tumor growth directly 
or indirectly (such as, for example, via ADCC ) . 

The protein may exhibit its tumor inhibitory activity by acting on tumor 
tissue or tumor precursor tissue, by inhibiting formation of tissues 
necessary to support tumor growth (such as, for example, by inhibiting 
angioger.esis ) , by causing production of other factors, agents or cell types 
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which inhibit tumor growth, or by suppressing, eliminating or inhibiting 
factors, agents or cell types which promote tumor growth. 
<Other activity> 

The protein of the invention may also exhibit one or more of the 
following additional activities or effects: inhibiting the growth, infection 
or function of, or killing, infectious agents, including, bacteria, viruses, 
fungi and other parasites; 

effecting (suppressing or enhancing) bodily characteristics, including, 
height, weight, hair color, eye color, skin, fat to lean ratio or other tissue 
pigmentation, or organ or body part size or shape (such as, for example, 
breast augmentation or diminution); 

effecting elimination of dietary fat, protein, carbohydrate; 
effecting behavioral characteristics, including appetite, libido, stress, 
cognition (including cognitive disorders), depression and violent 
behaviors ; 

providing analgesic effects or other pain reducing effects; 

promoting differentiation and growth of embryonic stem cells in lineages 

other than hematopoietic lineages ; 

in the case of enzymes, correcting deficiencies of the enzyme and treating 
deficiency- related diseases. 

The polypeptide with above activities, is suspected to have following 
functions by itself or interaction with its ligands or receptors or 
association with other molecules. For example, proliferation or cell death 
of B cells, T cells and/or mast cells or class specific induction of B cells 
by promotion of class switch of immunoglobulin genes; differentiation of 
B cells to antibody- forming cells; proliferation, differentiation, or cell 
death of precursors of granulocytes; proliferation, differentiation, or cell 
death of precursors of monocytes-macrophages ; 
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proliferation, of up regulation or cell death of neutrophils, 
monocytes-macrophages, eosinophils and/or basophils; proliferation, or cell 
death of precursors of megakaryocytes; proliferation, differentiation, or 
cell death of precursors of neutrophils; proliferation, differentiation, 
or cell death of precursors of T cells and B cells; promotion of production 
of erythrocytes; sustainment of proliferation of erythrocytes, neutrophils, 
eosinophils, basophils, monocytes-macrophages, mast cells, precursors of 
megakaryocyte ; promotion of migration of neutrophils, monocytes- 
macrophages, B cells and/or T cells; proliferation or cell death of 
thymocytes; suppression of differentiation of adipocytes; proliferation or 
cell death of natural killer cells; 

proliferation or cell death of hematopoietic stem cells; suppression of 
proliferation of stem cells and each hematopoietic precursor cells ; promotion 
of differentiation from mesenchymal stem cells to osteoblasts or chondrocytes , 
proliferation or cell death of mesenchymal stem cells, osteoblasts or 
chondrocytes and promotion of bone absorption by activation of osteoclasts 
and promotion of differentiation from monocytes to osteoclasts. 

This peptide is also suspected to function to nervous system, so 
expected to have functions below; differentiation to kinds of 
neurotransmitter-responsive neurons, survival or cell death of these cells; 
promotion of proliferation or cell death of glial cells; spread of neural 
dendrites; survival or cell death of gangr iocy tes ; proliferation, promotion 
of differentiation, or cell death of astrocytes; proliferation or survival 
of peripheral neurons; proliferation or cell death of Schwann cells; 
proliferation, survival or cell death of motoneurons. 

Furthermore, in the process of development of early embryonic, this 
polypeptide is expected to promote or inhibit the organogenesis of epidermis, 
brain, backbone, and nervous system by induction of ectoderm, that of 
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notochord connective tissues ( bone, muscle, tendon), hemocytes, heart, 
kidney, and genital organs by induction of mesoderm, and that of digestive 
apparatus (stomach, intestine, liver, pancreas), respiratory apparatus 
(lung, trachea) by induction of endoderm. In adult, also, this polypeptide 
is thought to proliferate or inhibit the above organs. 

Therefore, this polypeptide itself is expected to be used as an agent 
for the prevention or treatment of disease of progression or suppression 
of immune, nervous, or bone metabolic function, hypoplasia or overgrowth 
of hematopoietic cells: inflammatory disease (rheumatism, ulcerative 
colitis, etc.), decrease of hematopoietic stem cells after bone marrow 
transplantation, decrease of leukocytes, platelets, B-cells, or T-cells 
after radiation exposure or chemot her apeut ic dosage against cancer or 
leukemia, anemia, in f ec t ious di sease , cancer, leukemia, AIDS, bone metabolic 
disease ( osteoporosis etc.), arteriosclerosis, various degenerative disease 
(Alzheimer's disease, multiple sclerosis, etc.), or nervous lesion. 

In addition, since this polypeptide is thought to induce the 
differentiation or growth of organs derived from ectoderm, mesoderm, and 
endoderm, this polypeptide is expected to be an agent for tissue repair 
(epidermis, bone, muscle, tendon, heart, kidney, stomach, intestine, liver, 
pancreas, lung, and trachea, etc.). 

Quantitation of this polypeptide in the body can be performed using 
polyclonal or monoclonal antibodies against this polypeptide. It can be used 
the study of relationship between this polypeptide and disease or diagnosis 
of disease, and so on. Polyclonal and monoclonal antibodies can be prepared 
using this polypeptide or its fragment as an antigen by known method. 

Identification, purification or molecular cloning of known or unknown 
proteins which bind this polypeptide can be performed using this polypeptide 
by, for example, preparation of the affinity-column. 

3 1 




Identification of the molecules which interact with this polypeptide 
and molecular cloning of the gene can be performed by west-western method 
using this polypeptide or by yeast two-hybrid system using the cDNA 
(preferably cDNA encoding this polypeptide). 

Agonists/antagonists of this receptor polypeptide and inhibitors 
between receptor and signal transduction molecules can be screened using 
this polypeptide. 

For example, the screening can be carried out the following method. 

a) The reaction mixtures, which contain this polypeptide , screening compound 
and the cells, are incubated under the condition which the cells are normally 
stimulated by this peptide. (The reaction mixtures also contain the labeled 
compound, which is introduced into the cells according to the cell 
proliferation, and peptide which allow to observe the function of this peptide 
efficiently . ) 

b) Decision that the compounds are efficient agonis t s / ant agoni s t s or not, 
are performed by measurement of cell proliferation ability. 

More detailed methods are followed: 

Rat vascular muscle cell line (ATCC CRL-1444 or CRL1476) is cultured 
in 96 well plate with 10%FBS for 24 hours . Then the culture medium are replaced 
to the serum-free medium supplemented with each several concentrations of 
human PDGF-BB . At that time compounds to screen as well as A55 protein are 
added in the medium when screening the antagonists of A55 protein. While, 
compounds alone are added in the medium when screening the agonists of A55 
protein. After 24 hours incubation, these cells are pulsed for 4hours with 
3H- thymidine . By measuring the 3H-thymidine incorporation, it is possible 
to determine whether the compounds have inhibitory or stimulatory effect 
on the A55 activity. 

cDNAs of the present invention are useful not only the important and 
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essential template for the production of the polypeptide of the present 
invention which is expected to be largely useful, but also be useful for 
diagnosis or therapy (for example, treatment of gene lacking, treatment to 
stop the expression of the polypeptide by antisense DNA ( RNA ) ) . 
Genomic DNA may be isolated with the cDNA of the present invention, as a 
probe. As the same manner, a human gene encoding which can be highly 
homologous to the cDNA of the present invention, that is, which encodes a 
polypeptide highly homologous to the polypeptide of the present invention 
and a gene of animals excluding mouse which can be highly homologous to the 
cDNA of the present invention, also may be isolated. 

Application for Pharmaceuticals 

For the medical treatment for diseases described above, the 
polypeptide of the invention or the antibody of the polypeptide of the 
invention may be administered systemically or partially in most cases, 
usually by oral or parenteral administration, preferably orally, 
intravenously or intraventricular ly . 

The doses to be administered depend upon age, body weight, symptom, 
desired therapeutic effect, route of administration, and duration of the 
treatment etc. In human adults, one dose per person is generally between 
100 ug and 100 mg , by oral administration, up to several times per day, and 

between 10 ug and 100 mg , by parenteral administration up to several times 
per day. 

As mentioned above, the doses to be used depend upon various conditions . 
Therefore, there are cases in which doses lower than or greater than the 
ranges specified above may be used. 

The compounds of the present invention, may be administered as solid 
compositions, liquid compositions or other compositions for oral 
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administration, as injections, liniments or suppositories etc. for 
parenteral administration . 

Solid compositions for oral administration include compressed tablets, 
pills, capsules, dispersible powders, granules. Capsules include soft or 
hard capsules . 

In such compositions, one or more of the active compound(s) is or are 
admixed with at least one inert diluent (such as lactose, mannitol, glucose, 
hydroxypropyl cellulose, microcrys talline cellulose, starch, 
polyvinylpyrrolidone, magnesium metasilicate aluminate, etc.). The 
compositions may also comprise, as is normal practice, additional substances 
other than inert diluents: e.g. lubricating agents (such as magnesium 
stearate etc. ) , disintegrating agents (such as cellulose calcium glycolate, 
etc.), stabilizing agents (such as human serum albumin, lactose etc.), and 
assisting agents for dissolving (such as arginine, asparaginic acid etc .) . 

The tablets or pills may, if desired, be coated with a film of gastric 
or enteric materials (such as sugar, gelatin, hydroxypropyl cellulose or 
hydroxypropylmethyl cellulose phthalate, etc . ) , or be coated with more than 
two films. And then, coating may include containment within capsules of 
absorbable materials such as gelatin. 

Liquid compositions for oral administration include 
pharmaceutically-acceptable emulsions, solutions, syrups and elixirs. In 
such compositions, one or more of the active compound(s) is or are contained 
in inert diluent(s) commonly used (purified water, ethanol etc.). 3esides 
inert diluents, such compositions may also comprise adjuvants (such as 
wetting agents, suspending agents, etc.), sweetening agents, flavoring 
agents, perfuming agents, and preserving agents. 

Other compositions for oral administration include spray compositions 
which may be prepared by known methods and which comorise one or -ore of 
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the active compound ( s ) . Spray compositions may comprise additional 
substances other than inert diluents: e.g. stabilizing agents (sodium 
sulfite etc.), isotonic buffer (sodium chloride, sodium citrate, citric 
acid, etc.). For preparation of such spray compositions, for example, the 
method described in the United States Patent No. 2,868,691 or 3,095,355 
(herein incorporated in their entireties by reference) may be used. 

Injections for parenteral administration include sterile aqueous or 
non-aqueous solutions, suspensions and emulsions. In such compositions, one 
or more active compound(s) is or are admixed with at least one inert aqueous 
diluent(s) (distilled water for injection, physiological salt solution, 
etc.) or inert non-aqueous diluents ( s )( propylene glycol, polyethylene 
glycol, olive oil, ethanol , POLYSOLBATE 80 TM , etc.). 

Injections may comprise additional compound other than inert diluents : 
e.g. preserving agents, wetting agents, emulsifying agents, dispersing 
agents, stabilizing agent ( such as human serum albumin , lactose, etc.), and 
assisting agents such as assisting agents for dissolving (arginine, 
asparaginic acid, etc.). 

The following examples concerning clone A55 are illustrated, but not 
limit the present invention. 

Example 1 

Preparation of poly(A)+RNA 

Total RNA was prepared from mouse day!8.5 embryonic heart by TRIzol 
reagent (Trade Mark, GIBCOBRL) , and poly (A) + RNA was purified from the total 
RNA by mRNA Purification Kit (Trade Mark, Pharmacia). 




Example 2 

Preparation of yeast SST cDNA library 

Double strand cDNA was synthesized by Superscript Plasmid System for 
cDNA Synthesis and Plasmid Cloning (brand name, GIBCOBRL) with above 
poly(A)+RNA as template and random 9mer as primer which was containing Xhol 
site : 

5'-CGA TTG AAT TCT AGA CCT GCC TCG AGN NNN NNN NN- 3 ' 

cDNA was ligated EcoRI adapter by DNA ligation kit ver.2 (trade name, Takara 
Shuzo; this kit was used in all ligating steps hereafter.) and digested by 
Xhol. cDNAs were separated by agarose-gel electrophoresis. 300 - 800 bp 
cDNAs were isolated and were ligated to EcoRI/NotI site of pSUC2 (see US 
5 , 536 , 637). E. Coli DH 1 OB strain were transformed by pSUC2 with 
electropolation to obtain yeast SST cDNA library. 

Example 3 

Screening by SST method and DNA sequencing of positive clone 

Plasmids of the cDNA library were prepared. Yeast YTK12 strain were 
transformed by the plasmids with lithium acetate method (Current Protocols 
In Molecular Biology 13.7.1). The transformed yeast were plated on 
triptphan-f ree medium (CMD-Try medium) for selection. The plate was 
incubated for 48 hour at 30 oC . Replica of the colony which is obtained by 
Accutran Replica Plater (trade name, Schleicher & Schuell) were place YPR 
plate containing raffinose for carbon source, and the plate was incubated 
for 14 days at 30 oC . 

After 3 days , each colony appeared was streaked on YPR plate again. The plates 
were incubated for 48 hours at 30 oC . Single colony was inoculated to YPR 
medium and was incubated for 48 hours at 30 oC . Then plasmids were prepared. 
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Insert cDNA was amplified by PCR with two kind primers which exist end side 
of cloning site on pSUC2 (sense strand primers were biotiny lated ) . 
Biotinylated single strand of cDNAs were purified with Dynabeads (trade name, 
DYNAL) and determined the nucleotide sequences. 

Sequencing was performed by Dye Terminator Cycle Sequencing Ready Reaction 
with DNA Sequencing kit (trade name, Applied Biosystems Inc.) and sequence 
was determined by DNA sequencer 373 (Applied Biosystems Inc.). All 
sequencing hereafter was carried with this method. 

The clone named A55 is not registered on databases by homology search 
of cDNA sequence and deduced amino acid sequence and so it is cleared that 
the sequence is novel one. So, we tried to isolate clone full-length cDNA 
of the fragment of A55 clone (hereafter A55 SST fragment cDNA) . We confirmed 
that A55 SST fragment cDNA contains signal peptide by comparison with known 
peptide which has signal peptide in view of function and structure. 
Example 4 

Cloning and sequencing of a full-length cDNA of A55 

Phage particles of a cDNA library of mouse dayl3 embryonic 
heart ( uni-ZAP XR, Stratagene) were transfected to E. coli XLl-Blue MRF* host 
cells ( Stratagene ). Obtained one million plaques were transferred to nylon 
membranes. The membranes were hybridized with 32P-labeled mouse A55 SST 
fragment cDNA as a probe. Many positive plaques were obtained. 
From one positive plaque the phage particles containing a cloned insert were 
prepared, and were subjected to conversion into phagemid particles 
(pBluescript SK(-)) by co-infection of E . coli XLl-Blue MRF * host 
cells ( Stratagene ) with ExAssist helper phage ( Stratagene ) . The phagemid 
particles were transfected to E . coli DH5 a . The plasmids were prepared "from 
the obtained t r ans f ormant s . 

Nucleotide sequence of 5 ' - end of c DNA were determined to confirm the existence 

3 7 



of the sequences of SST fragment cDNA- And then full-length sequencing were 
performed to obtain SEQ ID NO . 3 . 

An open reading frame was determined and translation region for amino acid 
sequence shown in SEQ ID NO. 2 and deduced full-length amino acid sequence 
shown in SEQ ID NO. 1 were obtained. Mature protein of the said polypeptide 
was deduced to 425 amino acids shown in SEQ ID NO. 3 (144. .1418) or 423 amino 
acids shown in SEQ ID NO. 4. Translation region of SEQ ID NO. 4 is shown 
in SEQ ID NO . 5 . 

It was confirmed that there was no identical sequences to the DNA of 
the present invention by homology search program, BLASTN and FASTA against 
public nucleotide database. And it was also confirmed that there were no 
identical sequences to the polypeptide of the present invention (mouse A55 
protein) by homologue search program, BLASTX, BLASTP and FASTA against amino 
acid database . 

It is revealed that the polypeptide of the present invention, mouse A55 is 
novel secretion protein since the polypeptide have no trans-membrane region 
by hydrophobisity analysis of the amino acid sequence. 



It was revealed that A55 protein contained six EGF like domains by 



amino acid sequence of clone mouse A55 ( 1-448 AA region) and the one of human 
Sl-5 (SwissProt Accession No. HSU03877) (1-387 AA region) by the comparison 
using BLASTX , BLASTP and FASTA. It was reported that human Sl-5 was a secreted 
protein containing EGF like domain, was induced in fibroblasts by growth 
arrest, and stimulated DNA synthesis (Beata Lecka-Czernik et . al . Mol . Cell. 
Biol. 15, 120-128, 1995). Farther it was revealed that A55 protein was 
homologous to many proteins containing EGF-like domain. 



motif search, so it was expected that clone A55 also possesses EGF family 




Significant homology were also recognized between the 
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Example 5 

Isolation of isoform gene of mouse A55 protein 

Initiation coden was determined by cloning of 5' -end cDNA by 5 ' -RACE 

(Rapid Amplification of cDNA Ends method using Marathon cDNA Amplification 
Kit (trade name, Clontech) . Double stranded cDNA template was prepared from 
poly(A)+RNA of mouse embryonic heart tissue. Primer mA55-Rl : 

5'- CGT TTG TGC ACT GCT GCT GTG CAT TCC -3' 

was prepared based on the information of full-length nucleotide sequences. 
PCR was performed with the said primer and adapter primer attached in the 
kit . 

Amplified cDNA was separated with agarose-gel electrophoresis, and to pGEM-T 
Vector (trade name, Promega), ligated in and transformed to E. Coli DH5a 
and then plasmid was prepared. The full-length nucleotide sequences were 
determined. We found two deferent 5 ' -end sequences. One was identical to 
the clone containing the sequence in SEQ ID NO. 3, the other contained unknown 
sequence and no trans 1 ational start site ATG (See SEQ ID NO. 7 and 8). 
The region defined from exon 1 of the clone was replaced by another exon 
which exists 4 00 bp downstream region of exon 1 was clarified by gene analysis . 
So it was cleared that the clone shown in SEQ ID NO. 8 was generated by 
alternative splicing of exon 1 . 

The clone encodes isoform protein shown in SEQ ID NO. 6 (6 amino acids in 
N termini of SEQ ID NO.l was replaced by 19 amino acids in N termini of SEQ 
ID NO. 6 ) . 

The mature protein of this polypeptide was deduced 425 amino acids shown 
in SEQ ID NO. 8 (340.. .1614) or 423 amino acids shown in SEQ ID NO. 9. SEQ 
ID NO. 10 is the trans 1 ational region of the polypeptide shown in SEQ ID 
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NO . 9 . 

Example 6 

Determination of nucleotide sequence of human A55 gene 

The present inventors found that Human EST sequence ( GENBANK Accession 

No. H17726) homologous to 5 * -end sequence of mouse A55 in the process of 

homology search shown in example 4 . 

And the present inventors buy the Clone ID 50483 derived from human 

brain cDNA library GENBANK Accession No. H17726 from American Type Culture 

Collection (ATCC). The full-length nucleotide sequence shown in SEQ ID NO. 

13 was determined with the same manner as in the determination of mouse A55. 
Open reading frame was determined and trans lational region shown in SEQ ID 
NO. 12 and deduced amino acid sequence shown in SEQ ID NO. 11 were obtained. 
From above results, it is clarified that the human clone is full-length and 
have 89.3 % homology to mouse A55 at DNA level ( trans 1 at ional region) and 
have 94.2 % homology to the one at amino acid level. It is suggested that 
the obtained human clone should be human counterpart of mouse A55. 

(The clone was called human A5 5 hereafter.) 
The mature protein of this polypeptide was deduced 425 amino acids shown 
in SEQ ID NO. 13 (238... 1512) or 423 amino acids shown in SEQ ID NO. 14. 
Translational region of the polypeptide shown in SEQ ID NO. 14 shows in SEQ 
ID NO. 15. 

It was confirmed that there was no identical sequences to the DNAs 
of the present invention by homology search program, BLASTN and FASTA against 
public nucleotide database. And it was also confirmed that there were no 
identical sequences to human A55 proteins by homologue search program, 3LASTX, 
BLAST? and FASTA against amino acid database. 
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Example 7 

Mouse A55 protein expression in mammalian cell 

Mouse full-length cDNA shown in SEQ ID NO. 3 was inserted into 
expression vector for mammalian cell pNotS (Kaufman et al . , Nucleic Acids 
Res. 19, 4485-4490 (1991)) and mouse A55 expression plasmid P NotS-mA55 was 
constructed . 

293T cells (which is derived from 293 cells (ATCC CRL-1573) and it 
stably transfected with SV40 T antigen) were transfected with pNotS and 
pNotS-mA55 using lipofection (GIBCOBRL). After preincubated for 19 hours, 
the cells were pulsed for 30 minutes with 35 S-Met in the Met-free medium. 
Then the cells were incubated in the medium containing Met for 5 hours. 
Supernatant of the cells was recovered and concentrated 10-fold using 
centricon-10 (trade name, AMICON). Samples were subjected to SDS- 
polyacrylamide-gel electrophoresis. The gel was dried and 35 S-labeled 
proteins were detected with BAS 2000 (Fuji Film). 

A band was detected at 60-70 kDa in the supernatant of pNot S-mA5 5 - trans f ec ted 
293T cells. This band was not detected in the supernatant of pNotS- 
transfected 293T cells. This results confirmed that recombinant mouse A55 
protein was expressed and secreted into the medium. 

Molecular weight (60-70 kDa) of this recombinant mouse A5 5 protein was greater 
than it (48 kDa) predicted from its amino acid sequences. As this protein 
had two potential N-linked glycosy lation sites and many Ser and Thr residues 
in which 0-linked glycosyl chain could be added , it was suggested that the 
mouse A55 protein was a glycoprotein. 

Example 8 

Measurement of inhibition on proliferation of rat vascular smooth muscle 
cells by mouse A55 protein 



Vascular smooth muscle cells were isolated from rat aorta ranging from 
heart to diaphragm and cultured primarily by the methods described in Shin 
Seikagaku Jikken Kouza 10 (The Japanese Biochemical Society). 
These ceils were co-incubated with 1, 3 or 10 ng/ml of human recombinant 
PDGF-BB (Genzyme, and 10 » (v/v) of the mock or mA55 supernatant prepared 
according to the method described in example 7. And BrdU incorporation was 
measured using a Cell Proliferation ELISA , BrdU colorimetric kit 
(Boehringer-Mannheim) . 

The supernatant from 293T cells transfected with P NotS-mA55 significantly 
inhibited BrdU incorporation of rat primary vascular smooth muscle cells, 
while the supernatant from 293T cells transfected with only pNo tS show no 

effect as shown in Fig. l. 

Moreover the supernatant from 293T cell, transfected with p N otS-mA55 also 
inhibited BrdU incorporation even when rat vascular smooth muscle cells were 
stimulated with 1, 3 or 10 ng/ml of PDGF and increased BrdU incorporation 
in a dose-dependent manner, whereas the supernatant from 293T cells 
transfected with only pNo tS did not affect compared with no supernatant 
addition (See Fig. l). 

These data revealed that the recombinant mouse A55 protein had the growth 
inhibitory activity on vascular smooth muscle cells. 

Example 9 

Human A55 protein expression in mammalian cell 

Human A55 expression plasmid, P NotS-hA55 , was constructed by inserting 
human full-length cDNA shown in SEQ ID N0 . 13 into was into expression vector 
for mammalian cell pNotS (Kaufman et al . , Nucleic Acids Res. 19, 4485-4490 
( 1991 ) ) . 

Cosl cells were transfected with pNotS and pNotS-hASS by lipofect.r. 
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( GIBCOBRL ) . After preincubation for 24 hours, the cells were pulsed for 5 
hours with 35S-Met and 35S-Cys. Supernatant of the cells was recovered and 
concentrated to 10-fold using centricon-10 (trade name, AMICON). Proteins 
in concentrated supernatant were separated by electrophoresis through 
SDS-page. The gel was dried and 35S-labeled proteins were detected with BAS 
2000 (Fuji Film) . 

A band was detected at 60-70 kDa in the supernatant from Cosl cells trans feet ed 
with pNotS-hA55. This band was not detected in the supernatant of pNotS 
transfected Cosl cells. These results confirmed that recombinant human A55 
protein was expressed and secreted into the medium. And human A55 protein 
was also suggested that sugar chains were also added to human A55 protein 
as well as mouse A55. 

Example 10 

Detection of the inhibitory activity on proliferation of rat vascular smooth 
muscle cells by human A55 protein 

A DNA fragment encoding a signal sequence of honey bee mertin, a tag 
sequence of six His residues and an enterokinase cleavage site was added 
to 5'-end of human A55 cDNA sequence from 238 to 1515 in SEQ ID NO. 13 or 
sequence in SEQ ID NO. 15 followed by stop codon and inserted into expression 
vector pNotS. Cosl cells were transfected with pNotS-hA55 plasmid DNA and 
pNotS control plasmid DNA. The supernatant was recovered , digested by 
enterokinase, pulled through nickel column to remove the linker peptide, 
and then concentrated 10-fold by centricon-10 (trade name, AMI CON ) . 

After rat vascular muscle cell line (ATCC CRL-1444) was cultured in 
96 well plates with 10% FBS for 24 hours, these cells were incubated for 
24 hours with serum-free medium supplemented with several concentrations 
(1, 10 or 10 ng/ml) of human PDGF-BB (Genzyme) and with 10 % total volume 
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of supernatant of Cosl cells which were transfected with pNotS-hA55 or pNotS , 
and then were pulsed for 4 hours with 3H- thymidine . After harvesting 
3H-thymidine incorporation was detected. In this cell line remarkable 
decrease of 3H-thymidine incorporation were observed by supplement with hA55 
supernatant, while there was no effect in the presence of the control 
supernatant . 

Moreover similar effects were also observed when using other rat vascular 
smooth muscle cell lines (ATCC CRL-1476 and CRL-2018) and human vascular 
smooth muscle cell line (ATCC CRL-1999). These results revealed that the 
recombinant human A55 protein also had the growth inhibitory activity on 
the vascular smooth muscle cells as well as mouse A55 protein. 

Morphological change was observed on the vascular smooth muscle cells 
treated with the supernatant from hA55-transf ected Cosl cells by microscopy 
detection. While no morphological change was observed on melanoma cell line 
SK-MEL-28 at the same experiment. Furthermore, hA55 protein was observed 
to induce the expression of chemokine JE and JK . 

Experiment 11 

Preparation of anti mouse A55 polyclonal antibody 

Three kinds of peptide fragments of mouse A55 were synthesized by solid 
phase method: 

RTNPVYRGP YSNP YSTS YSG ( 7 1-90) ( 48-67 of SEQ ID NO. 1) 

GAYY I FQI KSGNEGREF YMR (376-395) (353-372 of SEQ ID NO. 1) 

MTRP IKGPRDIQLDLEMITVN (406-426) (383-403 of SEQ ID NO. 1). 

Rabbits were immunized to these peptides as immunogen and the serum 
were prepared after measurement of the activity. Each anti-mouse A55 



antibody was purified by affinity column immobilized each peptide which was 
used as immunogen from the obtained serum. 

The supernatant prepared by the same method described in example 7, was 

subjected to SDS-PAGE , the separated proteins were transferred to Immobilon-P 

( PVDF membrane, trade name, Millopore) from the acrylamide gel. 

After blocking the membranes they were incubated with the anti mouse A55 

polyclonal antibody as the first antibody and by developing using ECL kit 

(Amersham), the recombinant mouse A55 protein was detected. 

A 6 0 k Da band was detected in the supernatant from mA55 transf ected Cosl 

cells as well as 35S-labeling experiment described in example 7. While no 

bands were detected in the supernatant from mock- trans f ected Cosl cells. 

These results confirmed that the obtained polyclonal antibodies specifically 

recognized the mouse A55 protein. 

Brief Description of Figures 

Fig. 1 It shows that mouse A55 protein inhibits proliferation of rat 

aortic vascular smooth muscle cells which was stimulated by PDGF . 

Fig. 2 It shows that human A55 protein inhibits proliferation of rat 

aortic vascular smooth muscle cells which was stimulated by PDGF. 



Sequence Listing 

SEQ ID NO. : 1 
Length : 448 amino acids 
Type : amino acid 
Topology : liner 
Molecule type : protein 
Sequence Description : 

Met Pro Gly Leu Lys Arg lie Leu Thr Val Thr lie Leu Ala Leu Trp 
-23 -20 -15 -10 

Leu Pro His Pro Gly Asn Ala Gin Gin Gin Cys Thr Asn Gly Phe Asp 
-5 1 5 

Leu Asp Arg Gin Ser Gly Gin Cys Leu Asp lie Asp Glu Cys Arg Thr 
10 15 20 25 

lie Pro Glu Ala Cys Arg Gly Asp Met Met Cys Val Asn Gin Asn Gly 
30 35 40 

Gly Tyr Leu Cys He Pro Arg Thr Asn Pro Val Tyr Arg Gly Pro Tyr 
45 50 55 

Ser Asn Pro Tyr Ser Thr Ser Tyr Ser Gly Pro Tyr Pro Ala Ala Ala 
60 65 70 

Pro Pro Val Pro Ala Ser Asn Tyr Pro Thr He Ser Arg Pro Leu Val 
75 80 85 
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Cys Arg Phe Gly Tyr Gin Met Asp Glu Gly Asn Gin Cys Val Asp Val 
90 95 100 105 

Asp Glu Cys Ala Thr Asp Ser His Gin Cys Asn Pro Thr Gin lie Cys 
110 115 120 

lie Asn Thr Glu Gly Gly Tyr Thr Cys Ser Cys Thr Asp Gly Tyr Trp 
125 130 135 

Leu Leu Glu Gly Gin Cys Leu Asp lie Asp Glu Cys Arg Tyr Gly Tyr 
140 145 150 

Cys Gin Gin Leu Cys Ala Asn Val Pro Gly Ser Tyr Ser Cys Thr Cys 
155 160 165 

Asn Pro Gly Phe Thr Leu Asn Asp Asp Gly Arg Ser Cys Gin Asp Val 
170 175 180 185 

Asn Glu Cys Glu Thr Glu Asn Pro Cys Val Gin Thr Cys Val Asn Thr 
190 195 200 

Tyr Gly Ser Phe lie Cys Arg Cys Asp Pro Gly Tyr Glu Leu Glu Glu 
205 2 10 2 15 



Asp Gly lie His Cys Ser Asp Met Asp Glu Cys Ser Phe Ser Glu Phe 
220 225 230 



Leu Cys Gin His Glu Cys Val Asn Gin Pro Gly Ser Tyr Phe Cys Ser 
235 240 24 5 

Cys Pro Pro Gly Tyr Val Leu Leu Asp Asp Asn Arg Ser Cys Gin Asp 
250 255 260 265 

lie Asn Glu Cys Glu His Arg Asn His Thr Cys Thr Ser Leu Gin Thr 
270 275 280 

Cys Tyr Asn Leu Gin Gly Gly Phe Lys Cys lie Asp Pro lie Ser Cys 
285 290 295 

Glu Glu Pro Tyr Leu Leu lie Gly Glu Asn Arg Cys Met Cys Pro Ala 
300 305 310 

Glu His Thr Ser Cys Arg Asp Gin Pro Phe Thr lie Leu Tyr Arg Asp 
315 320 325 

Met Asp Val Val Ser Gly Arg Ser Val Pro Ala Asp lie Phe Gin Met 
330 335 340 345 

Gin Ala Thr Thr Arg Tyr Pro Gly Ala Tyr Tyr He Phe Gin He Lys 
350 355 360 

Ser Gly Asn Glu Gly Arg Glu Phe Tyr Met Arg Gin Thr Gly Pro He 
365 370 375 

Ser Ala Thr Leu Val Met Thr Arg Pro He Lys Gly Pro Arg Asp He 
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380 



385 



390 



Gin Leu Asp Leu Glu Met lie Thr Val Asn Thr Val lie Asn Phe Arg 



395 



400 



405 



Gly Ser Ser Val He Arg Leu Arg He Tyr Val Ser Gin Tyr Pro Phe 
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415 



420 



425 



SEQ ID NO. : 2 

Length : 1344 base pairs 
Type : nucleic acid 
Strandness : single 
Topology : liner 
Molecule type : cDNA to mRNA 
Sequence Description 

ATGCCAGGAT TAAAAAGGAT ACTCACTGTT ACCATCTTGG CACTCTGGCT TCCACATCCT 60 
GGGAATGCAC AGCAGCAGTG CACAAACGGC TTTGACCTGG ACCGCCAGTC AGGACAGTGT 12 0 
CTAGATATTG ATGAATGCCG GACCATCCCT GAGGCTTGTC GTGGGGACAT GATGTGTGTC 180 
AACC AGAATG GCGGGTATTT GTGCATCCCT CGAACCAACC CAGTGTATCG AGGGCCTTAC 24 0 
TCAAATCCCT ACTCTACATC CTACTCAGGC CCATACCCAG CAGCGGCCCC ACCAGTACCA 3 00 



GCTTCCAACT ACCCCACGAT TTCAAGGCCT CTTGTCTGCC GCTTTGGGTA 7CAGATGGAT 3 60 



GAAGGCAACC AGTGTGTGGA TGTGGACGAG TGTGCAACAG ACTCACACCA GTGCAACCCT 42 0 

ACCCAGATCT GTATCAACAC TGAAGGAGGT TACACCTGCT CCTGCACCGA TGGGTACTGG 4 80 

CTTCTGGAAG GGCAGTGCCT AGATATTGAT GAATGTCGCT ATGGTTACTG CCAGCAGCTC 54 0 

TGTGCAAATG TTCCAGGATC CTATTCCTGT ACATGCAACC CTGGTTTCAC CCTCAACGAC 600 

GATGGAAGGT CTTGCCAAGA TGTGAACGAG TGCGAAACTG AGAATCCCTG TGTTCAGACC 6 60 

TGTGTCAACA CCTATGGCTC TTTCATCTGC CGCTGTGACC CAGGATATGA ACTTGAGGAA 720 

GATGGCATTC ACTGCAGTGA TATGGACGAG TGCAGCTTCT CCGAGTTCCT CTGTCAACAC 7 80 

GAGTGTGTGA ACCAGCCGGG CTCATACTTC TGCTCGTGCC CTCCAGGCTA CGTCCTGTTG 84 0 

GATGATAACC GAAGCTGCCA GGATATCAAT GAATGTGAGC ACCGAAACCA CACGTGTACC 9 00 

TCACTGCAGA CTTGCTACAA TCTACAAGGG GGCTTCAAAT GTATTGATCC CATCAGCTGT 9 60 

GAGGAGCCTT ATCTGCTGAT TGGTGAAAAC CGCTGTATGT GTCCTGCTGA GCACACCAGC 1020 

TGCAGAGACC AGCCATTCAC CATCCTGTAT CGGGACATGG ATGTGGTGTC AGGACGCTCC 1080 

GTTCCTGCTG ACATCTTCC A GATGCAAGC A ACAACCCGAT ACCCTGGTGC CTATTAC ATT 1140 

TTCCAGATC A AATCTGGCAA CGAGGGTCGA GAGTTCTATA TGCGGCAAAC AGGGCCTATC 12 00 
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AGTGCCACCC TGGTGATGAC ACGCCCCATC AAAGGGCCTC GGGACATCCA GCTGGACTTG 12 60 



GAGATGATCA CTGTCAACAC TGTCATCAAC TTCAGAGGCA GCTCCGTGAT CCGACTGCGG 132 0 



ATATATGTGT CGCAGTATCC GTTC 



SEQ ID NO. : 3 

Length : 2233 base pairs 
Type : nucleic acid 
Strandness : single 
Topology : liner 
Molecule type : cDNA to mRNA 
Original source 
Organism : Mus Musculus 

Organelle : dayl3 mouse embryonic heart 

Clone Name : mouse A55 
Feature 

Name/Key : CDS 

Location : 75 14 1 8 

Identification method : S 

Name/Key : sig peptide 

Location : 7 5 . . 1 4 3 

Identification method : S 

Name/Key : mat peptide 

Location : 1 4 4 1 4 1 8 

Identification method : S 
Sequence Description: 



AATTCGGCAC GAGCCCCAGT CCCACCGCAG AGCCTGCCTT CCTCGCGTCG CTTCTCCTCC 60 

CGCGCATCTT GGAT ATG CCA GGA TTA AAA AGG ATA CTC ACT GTT ACC ATC 110 
Met Pro Gly Leu Lys Arg lie Leu Thr Val Thr lie 
-23 -20 -15 

TTG GCA CTC TGG CTT CCA CAT CCT GGG AAT GCA CAG CAG CAG TGC ACA 158 
Leu Ala Leu Trp Leu Pro His Pro Gly Asn Ala Gin Gin Gin Cys Thr 
-10 -5 1 5 

AAC GGC TTT GAC CTG GAC CGC CAG TCA GGA CAG TGT CTA GAT ATT GAT 2 06 

Asn Gly Phe Asp Leu Asp Arg Gin Ser Gly Gin Cys Leu Asp lie Asp 
10 15 20 

GAA TGC CGG ACC ATC CCT GAG GCT TGT CGT GGG GAC ATG ATG TGT GTC 254 
Glu Cys Arg Thr lie Pro Glu Ala Cys Arg Gly Asp Met Met Cys Val 
25 30 35 

AAC CAG AAT GGC GGG TAT TTG TGC ATC CCT CGA ACC AAC CCA GTG TAT 3 02 

Asn Gin Asn Gly Gly Tyr Leu Cys lie Pro Arg Thr Asn Pro Val Tyr 
40 45 50 

CGA GGG CCT TAC TCA AAT CCC TAC TCT ACA TCC TAC TCA GGC CCA TAC 350 
Arg Gly Pro Tyr Ser Asn Pro Tyr Ser Thr Ser Tyr Ser Gly Pro Tyr 
55 60 65 

CCA GCA GCG GCC CCA CCA GTA CCA GCT TCC AAC TAC CCC ACG ATT TCA 3 98 

Pro Ala Ala Ala Pro Pro Val Pro Ala Ser Asn Tyr Pro Thr lie Ser 
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70 75 80 85 

AGG CCT CTT GTC TGC CGC TTT GGG TAT CAG ATG GAT GAA GGC AAC CAG 44 6 

Arg Pro Leu Val Cys Arg Phe Gly Tyr Gin Met Asp Glu Gly Asn Gin 
90 95 100 

TGT GTG GAT GTG GAC GAG TGT GCA ACA GAC TCA CAC CAG TGC AAC CCT 4 94 

Cys Val Asp Val Asp Glu Cys Ala Thr Asp Ser His Gin Cys Asn Pro 
105 110 115 

ACC CAG ATC TGT ATC AAC ACT GAA GGA GGT TAC ACC TGC TCC TGC ACC 54 2 

Thr Gin lie Cys lie Asn Thr Glu Gly Gly Tyr Thr Cys Ser Cys Thr 
120 125 130 

GAT GGG TAC TGG CTT CTG GAA GGG CAG TGC CTA GAT ATT GAT GAA TGT 590 
Asp Gly Tyr Trp Leu Leu Glu Gly Gin Cys Leu Asp lie Asp Glu Cys 
135 140 145 

CGC TAT GGT TAC TGC CAG CAG CTC TGT GCA AAT GTT CCA GGA TCC TAT 638 

Arg Tyr Gly Tyr Cys Gin Gin Leu Cys Ala Asn Val Pro Gly Ser Tyr 
150 155 160 165 

TCC TGT ACA TGC AAC CCT GGT TTC ACC CTC AAC GAC GAT GGA AGG TCT 686 

Ser Cys Thr Cys Asn Pro Gly Phe Thr Leu Asn Asp Asp Gly Arg Ser 
170 175 180 

TGC CAA GAT GTG AAC GAG TGC GAA ACT GAG AAT CCC TGT GTT CAG ACC 7 34 

Cys Gin Asp Val Asn Glu Cys Glu Thr Glu Asn Pro Cys Val Gin Thr 
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185 190 195 

TGT GTC AAC ACC TAT GGC TCT TTC ATC TGC CGC TGT GAC CCA GGA TAT 7 82 

Cys Val Asn Thr Tyr Gly Ser Phe lie Cys Arg Cys Asp Pro Gly Tyr 
200 205 210 



GAA CTT GAG GAA GAT GGC ATT CAC TGC AGT GAT ATG GAC GAG TGC AGC 83 0 

Glu Leu Glu Glu Asp Gly lie His Cys Ser Asp Met Asp Glu Cys Ser 
215 220 225 



TTC TCC GAG TTC CTC TGT CAA CAC GAG TGT GTG AAC CAG CCG GGC TCA 87 8 

Phe Ser Glu Phe Leu Cys Gin His Glu Cys Val Asn Gin Pro Gly Ser 
230 235 240 245 



TAC TTC TGC TCG TGC CCT CCA GGC TAC GTC CTG TTG GAT GAT AAC CGA 92 6 

Tyr Phe Cys Ser Cys Pro Pro Gly Tyr Val Leu Leu Asp Asp Asn Arg 
250 255 260 



AGC TGC CAG GAT ATC AAT GAA TGT GAG CAC CGA AAC CAC ACG TGT ACC 9 74 

Ser Cys Gin Asp lie Asn Glu Cys Glu His Arg Asn His Thr Cys Thr 
265 270 275 



TCA CTG CAG ACT TGC TAC AAT CTA CAA GGG GGC TTC AAA TGT ATT GAT 102 2 
Ser Leu Gin Thr Cys Tyr Asn Leu Gin Gly Gly Phe Lys Cys lie Asp 
280 285 290 



CCC ATC AGC TGT GAG GAG CCT TAT CTG CTG ATT GGT GAA AAC CGC TGT 1070 
Pro lie Ser Cys Glu Glu Pro Tyr Leu Leu lie Gly Glu Asn Arg Cys 
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295 300 305 

ATG TGT CCT GCT GAG CAC ACC AGC TGC AGA GAC CAG CCA TTC ACC ATC 1118 

Met Cys Pro Ala Glu His Thr Ser Cys Arg Asp Gin Pro Phe Thr lie 

310 315 320 325 

CTG TAT CGG GAC ATG GAT GTG GTG TCA GGA CGC TCC GTT CCT GCT GAC 1166 

Leu Tyr Arg Asp Met Asp Val Val Ser Gly Arg Ser Val Pro Ala Asp 

330 335 340 

ATC TTC CAG ATG CAA GCA ACA ACC CGA TAC CCT GGT GCC TAT TAC ATT 1214 
lie Phe Gin Met Gin Ala Thr Thr Arg Tyr Pro Gly Ala Tyr Tyr lie 
345 350 355 

TTC CAG ATC AAA TCT GGC AAC GAG GGT CGA GAG TTC TAT ATG CGG CAA 12 62 
Phe Gin lie Lys Ser Gly Asn Glu Gly Arg Glu Phe Tyr Met Arg Gin 
360 365 370 

ACA GGG CCT ATC AGT GCC ACC CTG GTG ATG ACA CGC CCC ATC AAA GGG 13 10 
Thr Gly Pro He Ser Ala Thr Leu Val Met Thr Arg Pro He Lys Gly 
375 380 385 

CCT CGG GAC ATC CAG CTG GAC TTG GAG ATG ATC ACT GTC AAC ACT GTC 135 8 

Pro Arg Asp He Gin Leu Asp Leu Glu Met He Thr Val Asn Thr Val 
390 395 4 00 405 

ATC AAC TTC AGA GGC AGC TCC GTG ATC CGA CTG CGG ATA TAT GTG TCG 14 06 

He Asn Phe Arg Gly Ser Ser Val He Arg Leu Arg He Tyr Val Ser 




410 



415 



420 



CAG TAT CCG TTC TGAGCCTCTG GCTAAGGCCT CTGACACTGC CTTTCACCAG 



1458 



Gin Tyr Pro Phe 
425 

CACCGAGGGA CGGGAGGAGA AAGGAAACCA GCAAGAATGA GAGCGAGACA GACATTGCAC 15 18 

CTTTCCTGCT GAATATCTCC TGGGGGCATC AGCCTAGCAT CTTGACCCAT ATCTGTACTA 15 7 8 

TTGCAGATGG TCACTCTGAA GGACACCCTG CCCTCAGTTC CTATGATGCA GTTATCCAAA 1638 

AGTGTTCATC TTAGCCCCTG ATATGAGGTT GCCAGTGACT CTTCAAAGCC TTCCATTTAT 169 8 

TTCCATCGTT TTATAAAAAA GAAAATAGAT TAGATTTGCT GGGGTATGAG TCCTCGAAGG 175 8 

TTCAAAAGAC TGAGTGGCTT GCTCTCACCT CTTCCTCTCC TTCCTCCATC TCTTGCTGCA 1818 

TTGCTGCTTT GCAAAAGTCC TCATGGGCTC GTGGGAAATG CTGGGAATAG CTAGTTTGCT 1878 

TCTTGCATGT TCTGAGAAGG CTATGGGAAC ACACCACAGC AGGATCGAAG GTTTTTATAG 1938 

AGTCTATTTT AAAATCACAT CTGGTATTTT CAGCATAAAA GAAATTTTAG TTGTCTTTAA 19 9 8 

AATTTGTATG AGTGTTTAAC CTTTTCTTAT TCATTTTGAG GCTTCTTAAA GTGGTAGAAT 2058 

TCCTTCCAAA GGCCTCAGAT AC ATGTTATG TTC AGTCTTT CCAACCTCAT CCTTTCCTGC 2118 
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ATCTTAGCCC AGTTTTTACG AAGACCCCTT AATCATGCTT TNTTAAGAGT TTTTACCCAA 2 178 
CTGCGTTGGA AGACAGAGGT ATCCAGACTG ATTAAATAAT TGAAGAAAAA AAAAA 2233 

SEQ ID NO. : 4 

Length : 423 amino acids 
Type : amino acid 
Topology : liner 
Molecule type : protein 
Sequence Description : 

Gin Cys Thr Asn Gly Phe Asp Leu Asp Arg Gin Ser Gly Gin Cys Leu 
15 10 15 

Asp lie Asp Glu Cys Arg Thr lie Pro Glu Ala Cys Arg Gly Asp Met 
20 25 30 

Met Cys Val Asn Gin Asn Gly Gly Tyr Leu Cys lie Pro Arg Thr Asn 
35 40 45 

Pro Val Tyr Arg Gly Pro Tyr Ser Asn Pro Tyr Ser Thr Ser Tyr Ser 
50 55 60 

Gly Pro Tyr Pro Ala Ala Ala Pro Pro Val Pro Ala Ser Asn Tyr Pro 
65 7 0 75 80 

Thr He Ser Arg Pro Leu Val Cys Arg Phe Gly Tyr Gin Met Asp Glu 
85 90 95 
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Gly Asn Gin Cys Val Asp Val Asp Glu Cys Ala Thr Asp Ser His Gin 
100 105 110 

Cys Asn Pro Thr Gin He Cys He Asn Thr Glu Gly Gly Tyr Thr Cys 
115 120 125 

Ser Cys Thr Asp Gly Tyr Trp Leu Leu Glu Gly Gin Cys Leu Asp He 
130 135 140 

Asp Glu Cys Arg Tyr Gly Tyr Cys Gin Gin Leu Cys Ala Asn Val Pro 
145 150 155 160 

Gly Ser Tyr Ser Cys Thr Cys Asn Pro Gly Phe Thr Leu Asn Asp Asp 
165 170 175 

Gly Arg Ser Cys Gin Asp Val Asn Glu Cys Glu Thr Glu Asn Pro Cys 
180 185 190 

Val Gin Thr Cys Val Asn Thr Tyr Gly Ser Phe He Cys Arg Cys Asp 
195 200 205 

Pro Gly Tyr Glu Leu Glu Glu Asp Gly He His Cys Ser Asp Met Asp 
210 215 220 

Glu Cys Ser Phe Ser Glu Phe Leu Cys Gin His Glu Cys Val Asn Gin 
225 230 235 240 



Pro Gly Ser Tyr Phe Cys Ser Cys Pro Pro Gly Tyr Val Leu Leu Asp 
245 250 255 

Asp Asn Arg Ser Cys Gin Asp lie Asn Glu Cys Glu His Arg Asn His 
260 265 270 

Thr Cys Thr Ser Leu Gin Thr Cys Tyr Asn Leu Gin Gly Gly Phe Lys 
275 280 285 

Cys lie Asp Pro lie Ser Cys Glu Glu Pro Tyr Leu Leu lie Gly Glu 
290 295 300 

Asn Arg Cys Met Cys Pro Ala Glu His Thr Ser Cys Arg Asp Gin Pro 
305 310 315 320 

Phe Thr lie Leu Tyr Arg Asp Met Asp Val Val Ser Gly Arg Ser Val 
325 330 335 

Pro Ala Asp lie Phe Gin Met Gin Ala Thr Thr Arg Tyr Pro Gly Ala 
340 345 350 

Tyr Tyr lie Phe Gin lie Lys Ser Gly Asn Glu Gly Arg Glu Phe Tyr 
355 360 365 

Met Arg Gin Thr Gly Pro lie Ser Ala Thr Leu Val Met Thr Arg Pro 
370 375 380 

lie Lys Gly Pro Arg Asp lie Gin Leu Asp Leu Glu Met lie Thr Val 
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385 



390 



395 



400 



Asn Thr Val He Asn Phe Arg Gly Ser Ser Val He Arg Leu Arg He 



405 



410 



415 



Tyr Val Ser Gin Tyr Pro Phe 
420 

SEQ ID NO. : 5 

Length : 1269 base pairs 
Type : nucleic acid 
Strandness : single 
Topology : liner 
Molecule type : cDNA to mRNA 
Sequence Description : 

CAGTGCACAA ACGGCTTTGA CCTGGACCGC CAGTCAGGAC AGTGTCTAGA TATTGATGAA 6 0 
TGCCGGACCA TCCCTGAGGC TTGTCGTGGG GACATGATGT GTGTCAACCA GAATGGCGGG 120 
TATTTGTGCA TCCCTCGAAC CAACCCAGTG TATCGAGGGC CTTACTCAAA TCCCTACTCT 180 
ACATCCTACT CAGGCCCATA CCCAGCAGCG GCCCCACCAG TACCAGCTTC CAACTACCCC 240 
ACGATTTCAA GGCCTCTTGT CTGCCGCTTT GGGTATCAGA TGGATGAAGG CAACCAGTGT 3 00 
GTGGATGTGG ACGAGTGTGC AACAGACTCA CACCAGTGCA ACCCTACCCA GATCTGTATC 3 60 
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AACACTGAAG GAGGTTACAC CTGCTCCTGC ACCGATGGGT ACTGGCTTCT GGAAGGGCAG 4 20 
TGCCTAGATA TTGATGAATG TCGCTATGGT TACTGCCAGC AGCTCTGTGC AAATGTTCCA 4 80 
GGATCCTATT CCTGTACATG CAACCCTGGT TTCACCCTCA ACGACGATGG AAGGTCTTGC 54 0 
CAAGATGTGA ACGAGTGCGA AACTGAGAAT CCCTGTGTTC AGACCTGTGT CAACACCTAT 600 
GGCTCTTTCA TCTGCCGCTG TGACCCAGGA TATGAACTTG AGGAAGATGG CATTCACTGC 660 
AGTGATATGG ACGAGTGCAG CTTCTCCGAG TTCCTCTGTC AACACGAGTG TGTGAACCAG 72 0 
CCGGGCTCAT ACTTCTGCTC GTGCCCTCCA GGCTACGTCC TGTTGGATGA TAACCGAAGC 7 80 
TGCCAGGATA TCAATGAATG TGAGCACCGA AACCACACGT GTACCTCACT GCAGACTTGC 84 0 
TACAATCTAC AAGGGGGCTT CAAATGTATT GATCCCATCA GCTGTGAGGA GCCTTATCTG 900 
CTGATTGGTG AAAACCGCTG TATGTGTCCT GCTGAGCACA CCAGCTGCAG AGACCAGCCA 960 
TTCACCATCC TGTATCGGGA CATGGATGTG GTGTCAGGAC GCTCCGTTCC TGCTGACATC 102 0 
TTCCAGATGC AAGCAACAAC CCGATACCCT GGTGCCTATT ACATTTTCCA GATCAAATCT 108 0 
GGCAACGAGG GTCGAGAGTT CTATATGCGG CAAACAGGGC CTATCAGTGC CACCCTGGTG 114 0 
ATGACACGCC CCATCAAAGG GCCTCGGGAC ATCCAGCTGG ACTTGGAGAT GATCACTGTC 1200 
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AACACTGTCA TCAACTTCAG AGGCAGCTCC GTGATCCGAC TGCGGATATA TGTGTCGCAG 1260 

TATCCGTTC 12 69 

SEQ ID NO. : 6 
Length : 461 amino acids 
Type : amino acid 
Topology : liner 
Molecule type : protein 
Sequence Description : 

Met Gly Pro Arg Ser Phe Glu Pro Met His Ser Gly Leu Cys Arg Gin 
-36 -35 -30 -25 

Arg Arg Met He Leu Thr Val Thr He Leu Ala Leu Trp Leu Pro His 
-20 -15 -10 -5 

Pro Gly Asn Ala Gin Gin Gin Cys Thr Asn Gly Phe Asp Leu Asp Arg 
1 5 10 

Gin Ser Gly Gin Cys Leu Asp He Asp Glu Cys Arg Thr He Pro Glu 
15 20 25 

Ala Cys Arg Gly Asp Met Met Cys Val Asn Gin Asn Gly Gly Tyr Leu 
30 35 40 

Cys lie Pro Arg Thr Asn Pro Val Tyr Arg Gly Pro Tyr Ser Asn Pre 
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45 50 55 60 

Tyr Ser Thr Ser Tyr Ser Gly Pro Tyr Pro Ala Ala Ala Pro Pro Val 
65 70 75 

Pro Ala Ser Asn Tyr Pro Thr He Ser Arg Pro Leu Val Cys Arg Phe 
80 85 90 

Gly Tyr Gin Met Asp Glu Gly Asn Gin Cys Val Asp Val Asp Glu Cys 
95 100 105 

Ala Thr Asp Ser His Gin Cys Asn Pro Thr Gin He Cys He Asn Thr 
HO 115 120 

Glu Gly Gly Tyr Thr Cys Ser Cys Thr Asp Gly Tyr Trp Leu Leu Glu 
125 130 135 140 

Gly Gin Cys Leu Asp He Asp Glu Cys Arg Tyr Gly Tyr Cys Gin Gin 
145 150 155 

Leu Cys Ala Asn Val Pro Gly Ser Tyr Ser Cys Thr Cys Asn Pro Gly 
160 165 170 

Phe Thr Leu Asn Asp Asp Gly Arg Ser Cys Gin Asp Val Asn Glu Cys 
175 180 185 

Glu Thr Glu Asn Pro Cys Val Gin Thr Cys Val Asn Thr Tyr Gly Ser 
190 195 200 
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Phe lie Cys Arg Cys Asp Pro Gly Tyr Glu Leu Glu Glu Asp Gly lie 
205 210 215 220 

His Cys Ser Asp Met Asp Glu Cys Ser Phe Ser Glu Phe Leu Cys Gin 
225 230 235 

His Glu Cys Val Asn Gin Pro Gly Ser Tyr Phe Cys Ser Cys Pro Pro 
240 245 250 

Gly Tyr Val Leu Leu Asp Asp Asn Arg Ser Cys Gin Asp lie Asn Glu 
255 260 265 

Cys Glu His Arg Asn His Thr Cys Thr Ser Leu Gin Thr Cys Tyr Asn 
270 275 280 

Leu Gin Gly Gly Phe Lys Cys lie Asp Pro lie Ser Cys Glu Glu Pro 
285 290 295 300 

Tyr Leu Leu lie Gly Glu Asn Arg Cys Met Cys Pro Ala Glu His Thr 
305 310 315 

Ser Cys Arg Asp Gin Pro Phe Thr He Leu Tyr Arg Asp Met Asp Val 
320 325 330 

Val Ser Gly Arg Ser Val Pro Ala Asp He Phe Gin Met Gin Ala Thr 
335 340 345 
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Thr Arg Tyr Pro Gly Ala Tyr Tyr He Phe Gin He Lys Ser Gly Asn 
350 355 360 

Glu Gly Arg Glu Phe Tyr Met Arg Gin Thr Gly Pro He Ser Ala Thr 
365 370 375 380 

Leu Val Met Thr Arg Pro He Lys Gly Pro Arg Asp lie Gin Leu Asp 
385 390 395 

Leu Glu Met lie Thr Val Asn Thr Val lie Asn Phe Arg Gly Ser Ser 
400 405 410 

Val lie Arg Leu Arg lie Tyr Val Ser Gin Tyr Pro Phe 
415 420 425 

SEQ ID NO. : 7 

Length : 1383 base pairs 

Type : nucleic acid 

Strandness : single 

Topology : liner 

Molecule type : cDNA to mRNA 

Sequence Description: 

ATGGGACCTA GAAGTTTCGA GCC AATGCAC AGTGGACTCT GCAGACAGAG ACGCATGATA 60 
CTCACTGTTA CCATCTTGGC ACTCTGGCTT CCACATCCTG GGAATGCACA GCAGCAGTGC 120 
ACAAACGGCT TTGACCTGGA CCGCCAGTCA GGACAGTGTC TAGATATTGA TGAATGCCGG 180 



ACCATCCCTG AGGCTTGTCG TGGGGACATG ATGTGTGTCA ACCAGAATGG CGGGTATTTG 24 0 



TGCATCCCTC GAACCAACCC AGTGTATCGA GGGCCTTACT CAAATCCCTA CTCTACATCC 3 00 



TACTCAGGCC CATACCCAGC AGCGGCCCCA CCAGTACCAG CTTCCAACTA CCCCACGATT 3 60 



TCAAGGCCTC TTGTCTGCCG CTTTGGGTAT CAGATGGATG AAGGCAACCA GTGTGTGGAT 42 0 



GTGGACGAGT GTGCAACAGA CTCACACCAG TGCAACCCTA CCCAGATCTG TATCAACACT 4 80 



GAAGGAGGTT ACACCTGCTC CTGCACCGAT GGGTACTGGC TTCTGGAAGG GCAGTGCCTA 54 0 



GATATTGATG AATGTCGCTA TGGTTACTGC CAGCAGCTCT GTGCAAATGT TCCAGGATCC 600 



TATTCCTGTA CATGCAACCC TGGTTTCACC CTCAACGACG ATGGAAGGTC TTGCCAAGAT 660 



GTGAACGAGT GCGAAACTGA GAATCCCTGT GTTCAGACCT GTGTCAACAC CTATGGCTCT 72 0 



TTCATCTGCC GCTGTGACCC AGGATATGAA CTTGAGGAAG ATGGCATTCA CTGCAGTGAT 7 80 



ATGGACGAGT GCAGCTTCTC CGAGTTCCTC TGTCAACACG AGTGTGTGAA CCAGCCGGGC 84 0 



TCATACTTCT GCTCGTGCCC TCCAGGCTAC GTCCTGTTGG ATGATAACCG AAGCTGCCAG 900 



GATATC AATG AATGTGAGC A CCGAAACCAC ACGTGTACCT CACTGCAGAC TTGCTACAAT 9 60 



CTAC AAGGGG GCTTCAAATG TATTGATCCC ATCAGCTGTG AGGAGCCTTA TCTGCTGATT 102 0 
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GGTGAAAACC GCTGTATGTG TCCTGCTGAG CACACCAGCT GCAGAGACCA GCCATTCACC 1080 

ATCCTGTATC GGGACATGGA TGTGGTGTCA GGACGCTCCG TTCCTGCTGA CATCTTCCAG 114 0 

ATGCAAGCAA CAACCCGATA CCCTGGTGCC TATTACATTT TCCAGATCAA ATCTGGCAAC 1200 

GAGGGTCGAG AGTTCTATAT GCGGCAAACA GGGCCTATCA GTGCCACCCT GGTGATGACA 1260 

CGCCCCATCA AAGGGCCTCG GGACATCCAG CTGGACTTGG AGATGATCAC TGTCAACACT 1320 

GTCATCAACT TCAGAGGCAG CTCCGTGATC CGACTGCGGA TATATGTGTC GCAGTATCCG 1380 

TTC 1383 

SEQ ID NO. : 8 

Length : 2429 base pairs 

Type : nucleic acid 

Strandness : single 

Topology : liner 

Molecule type : cDNA to mRNA 

Original source 

Organism : Mus Musculus 

Organelle : dayl3 embryonic heart 

Clone Name : mouse A5 5b 
Feature 

Name/Key : CDS 

Location : 232 .. 1 6 1 4 
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Identification method : S 
Name/Key : sig peptide 
Location : 232 . . 339 
Identification method : S 
Name/ Key : mat peptide 
Location : 340 .. 1 614 
Identification method : S 
Sequence Description: 

CAGCATCTCG AGAGAGGCAG CAGACAACCT CTCTAGGTCA TTTCTCTTTC TTTTTGGAAA 6 0 

GGGCAGCAAC GTTGTGCGCA GTTTATAAAA TATCACACTA CATGTTTTTT AAATTTGGGA 120 

GACTGCTGAC TACGGCACCA GCAATTGCTT TGCTGCGACG GCTGTGAGAC AAGCAGAAGT 180 

CTCCGAACAC TTCTGTCTGC GTTTGCTCTA TGTGTGTGAT TTACAGAGGG A ATG GGA 237 

Met Gly 
-36 -35 

CCT AGA AGT TTC GAG CCA ATG CAC AGT GGA CTC TGC AGA CAG AGA CGC 2 85 

Pro Arg Ser Phe Glu Pro Met His Ser Gly Leu Cys Arg Gin Arg Arg 



-30 



-25 



-20 



ATG ATA CTC ACT GTT ACC ATC TTG GCA CTC TGG CTT CCA CAT CCT GGG 



333 



Met lie Leu Thr Val Thr lie Leu Ala Leu Trp Leu Pro His Pro Gly 



- 1 0 



AAT GCA CAG CAG CAG TGC AC A AAC GGC TTT GAC CTG GAC CGC CAG TCA 



38 1 
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Asn Ala Gin Gin Gin Cys Thr Asn Gly Phe Asp Leu Asp Arg Gin Ser 
1 5 10 

GGA CAG TGT CTA GAT ATT GAT GAA TGC CGG ACC ATC CCT GAG GCT TGT 429 
Gly Gin Cys Leu Asp lie Asp Glu Cys Arg Thr lie Pro Glu Ala Cys 
15 2 0 25 30 

CGT GGG GAC ATG ATG TGT GTC AAC CAG AAT GGC GGG TAT TTG TGC ATC 477 
Arg Gly Asp Met Met Cys Val Asn Gin Asn Gly Gly Tyr Leu Cys He 
35 40 45 

CCT CGA ACC AAC CCA GTG TAT CGA GGG CCT TAC TCA AAT CCC TAC TCT 525 
Pro Arg Thr Asn Pro Val Tyr Arg Gly Pro Tyr Ser Asn Pro Tyr Ser 
50 55 60 

ACA TCC TAC TCA GGC CCA TAC CCA GCA GCG GCC CCA CCA GTA CCA GCT 573 
Thr Ser Tyr Ser Gly Pro Tyr Pro Ala Ala Ala Pro Pro Val Pro Ala 
65 70 75 

TCC AAC TAC CCC ACG ATT TCA AGG CCT CTT GTC TGC CGC TTT GGG TAT 62 1 

Ser Asn Tyr Pro Thr He Ser Arg Pro Leu Val Cys Arg Phe Gly Tyr 
80 85 90 

CAG ATG GAT GAA GGC AAC CAG TGT GTG GAT GTG GAC GAG TGT GCA ACA 669 
Gin Met Asp Glu Gly Asn Gin Cys Val Asp Val Asp Glu Cys Ala Thr 
9 5 100 105 110 



GAC TCA CAC CAG TGC AAC CCT ACC CAG ATC TGT ATC AAC ACT GAA GGA 
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Asp Ser His Gin Cys Asn Pro Thr Gin lie Cys lie Asn Thr Glu Gly 
115 120 125 

GGT TAC ACC TGC TCC TGC ACC GAT GGG TAC TGG CTT CTG GAA GGG CAG 7 65 

Gly Tyr Thr Cys Ser Cys Thr Asp Gly Tyr Trp Leu Leu Glu Gly Gin 
130 135 140 

TGC CTA GAT ATT GAT GAA TGT CGC TAT GGT TAC TGC CAG CAG CTC TGT 813 
Cys Leu Asp He Asp Glu Cys Arg Tyr Gly Tyr Cys Gin Gin Leu Cys 
145 150 155 

GCA AAT GTT CCA GGA TCC TAT TCC TGT ACA TGC AAC CCT GGT TTC ACC 861 
Ala Asn Val Pro Gly Ser Tyr Ser Cys Thr Cys Asn Pro Gly Phe Thr 
160 165 170 

CTC AAC GAC GAT GGA AGG TCT TGC CAA GAT GTG AAC GAG TGC GAA ACT 9 09 

Leu Asn Asp Asp Gly Arg Ser Cys Gin Asp Val Asn Glu Cys Glu Thr 
175 180 185 190 

GAG AAT CCC TGT GTT CAG ACC TGT GTC AAC ACC TAT GGC TCT TTC ATC 95 7 

Glu Asn Pro Cys Val Gin Thr Cys Val Asn Thr Tyr Gly Ser Phe He 
195 200 205 

TGC CGC TGT GAC CCA GGA TAT GAA CTT GAG GAA GAT GGC ATT CAC TGC 1005 
Cys Arg Cys Asp Pro Gly Tyr Glu Leu Glu Glu Asp Gly He His Cys 
210 215 220 



AGT GAT ATG GAC GAG TGC AGC TTC TCC GAG TTC CTC TGT CAA CAC GAG 
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Ser Asp Met Asp Glu Cys Ser Phe Ser Glu Phe Leu Cys Gin His Glu 
225 230 235 

TGT GTG AAC CAG CCG GGC TCA TAC TTC TGC TCG TGC CCT CCA GGC TAC 1101 
Cys Val Asn Gin Pro Gly Ser Tyr Phe Cys Ser Cys Pro Pro Gly Tyr 
240 245 250 

GTC CTG TTG GAT GAT AAC CGA AGC TGC CAG GAT ATC AAT GAA TGT GAG 1149 
Val Leu Leu Asp Asp Asn Arg Ser Cys Gin Asp lie Asn Glu Cys Glu 
255 260 265 270 

CAC CGA AAC CAC ACG TGT ACC TCA CTG CAG ACT TGC TAC AAT CTA CAA 1197 
His Arg Asn His Thr Cys Thr Ser Leu Gin Thr Cys Tyr Asn Leu Gin 
275 280 285 

GGG GGC TTC AAA TGT ATT GAT CCC ATC AGC TGT GAG GAG CCT TAT CTG 12 45 
Gly Gly Phe Lys Cys lie Asp Pro lie Ser Cys Glu Glu Pro Tyr Leu 
290 295 300 

CTG ATT GGT GAA AAC CGC TGT ATG TGT CCT GCT GAG CAC ACC AGC TGC 12 93 
Leu He Gly Glu Asn Arg Cys Met Cys Pro Ala Glu His Thr Ser Cys 
305 310 315 

AGA GAC CAG CCA TTC ACC ATC CTG TAT CGG GAC ATG GAT GTG GTG TCA 134 1 
Arg Asp Gin Pro Phe Thr He Leu Tyr Arg Asp Met Asp Val Val Ser 
320 325 330 

GGA CGC TCC GTT CCT GCT GAC ATC TTC CAG ATG CAA GCA AC A ACC CGA 13 89 



Gly Arg Ser Val Pro Ala Asp lie Phe Gin Met Gin Ala Thr Thr Arg 
335 340 345 350 

TAC CCT GGT GCC TAT TAC ATT TTC CAG ATC AAA TCT GGC AAC GAG GGT 1437 

Tyr Pro Gly Ala Tyr Tyr lie Phe Gin lie Lys Ser Gly Asn Glu Gly 
355 360 365 

CGA GAG TTC TAT ATG CGG CAA ACA GGG CCT ATC AGT GCC ACC CTG GTG 14 85 
Arg Glu Phe Tyr Met Arg Gin Thr Gly Pro lie Ser Ala Thr Leu Val 
370 375 380 

ATG ACA CGC CCC ATC AAA GGG CCT CGG GAC ATC CAG CTG GAC TTG GAG 153 3 
Met Thr Arg Pro lie Lys Gly Pro Arg Asp lie Gin Leu Asp Leu Glu 
385 390 395 

ATG ATC ACT GTC AAC ACT GTC ATC AAC TTC AGA GGC AGC TCC GTG ATC 158 1 
Met He Thr Val Asn Thr Val He Asn Phe Arg Gly Ser Ser Val He 
400 405 410 

CGA CTG CGG ATA TAT GTG TCG CAG TAT CCG TTC TGAGCCTCTG GCTAAGGCCT 1634 
Arg Leu Arg He Tyr Val Ser Gin Tyr Pro Phe 
415 420 425 

CTGACACTGC CTTTCACC AG CACCGAGGGA CGGGAGGAGA AAGGAAACCA GCAAGAATGA 169 4 

GAGCGAGAC A GAC ATTGCAC CTTTCCTGCT GAATATCTCC TGGGGGC ATC AGCCTAGC AT 17 54 

CTTGACCC AT ATCTGTACTA TTGC AGATGG TCACTCTGAA GGAC ACCCTG CCCTCAGTTC 18 14 



CTATGATGCA GTTATCCAAA AGTGTTCATC 

CTTCAAAGCC TTCCATTTAT TTCCATCGTT 

GGGGTATGAG TCCTCGAAGG TTCAAAAGAC 

TTCCTCCATC TCTTGCTGCA TTGCTGCTTT 

CTGGGAATAG CTAGTTTGCT TCTTGCATGT 

AGGATCGAAG GTTTTTATAG AGTCTATTTT 

GAAATTTTAG TTGTCTTTAA AATTTGTATG 

GCTTCTTAAA GTGGTAGAAT TCCTTCCAAA 

CCAACCTCAT CCTTTCCTGC ATCTTAGCCC 

TNTTAAGAGT TTTTACCCAA CTGCGTTGGA 

TGAAGAAAAA AAAAA 

SEQ ID NO. : 9 

Length : 423 amino acids 
Type : amino acid 
Topology : liner 
Molecule type : protein 



TTAGCCCCTG ATATGAGGTT GCCAGTGACT 187 4 



TTATAAAAAA GAAAATAGAT TAGATTTGCT 19 34 



TGAGTGGCTT GCTCTCACCT CTTCCTCTCC 1994 



GCAAAAGTCC TCATGGGCTC GTGGGAAATG 2054 



TCTGAGAAGG CTATGGGAAC ACACCACAGC 2114 



AAAATCACAT CTGGTATTTT CAGCATAAAA 2 174 



AGTGTTTAAC CTTTTCTTAT TCATTTTGAG 2 23 4 



GGCCTCAGAT ACATGTTATG TTCAGTCTTT 22 9 4 



AGTTTTTACG AAGACCCCTT AATCATGCTT 2354 



AGACAGAGGT ATCCAGACTG ATTAAATAAT 24 14 




Sequence Description : 

Gin Cys Thr Asn Gly Phe Asp Leu Asp Arg Gin Ser Gly Gin Cys Leu 
1 5 10 15 

Asp He Asp Glu Cys Arg Thr He Pro Glu Ala Cys Arg Gly Asp Met 
20 25 30 

Met Cys Val Asn Gin Asn Gly Gly Tyr Leu Cys He Pro Arg Thr Asn 
35 40 45 

Pro Val Tyr Arg Gly Pro Tyr Ser Asn Pro Tyr Ser Thr Ser Tyr Ser 
50 55 60 

Gly Pro Tyr Pro Ala Ala Ala Pro Pro Val Pro Ala Ser Asn Tyr Pro 
65 70 75 80 

Thr He Ser Arg Pro Leu Val Cys Arg Phe Gly Tyr Gin Met Asp Glu 
85 90 95 

Gly Asn Gin Cys Val Asp Val Asp Glu Cys Ala Thr Asp Ser His Gin 
100 105 110 

Cys Asn Pro Thr Gin He Cys He Asn Thr Glu Gly Gly Tyr Thr Cys 
115 120 125 

Ser Cys Thr Asp Gly Tyr Trp Leu Leu Glu Gly Gin Cys Leu Asp He 
-30 135 140 



Asp Glu Cys Arg Tyr Gly Tyr Cys Gin Gin Leu Cys Ala Asn Val Pro 
145 150 155 160 



Gly Ser Tyr Ser Cys Thr Cys Asn Pro Gly Phe Thr Leu Asn Asp Asp 
165 170 175 

Gly Arg Ser Cys Gin Asp Val Asn Glu Cys Glu Thr Glu Asn Pro Cys 
180 185 190 

Val Gin Thr Cys Val Asn Thr Tyr Gly Ser Phe He Cys Arg Cys Asp 
195 200 205 

Pro Gly Tyr Glu Leu Glu Glu Asp Gly He His Cys Ser Asp Met Asp 
210 215 220 

Glu Cys Ser Phe Ser Glu Phe Leu Cys Gin His Glu Cys Val Asn Gin 
225 230 235 240 

Pro Gly Ser Tyr Phe Cys Ser Cys Pro Pro Gly Tyr Val Leu Leu Asp 
245 250 255 

Asp Asn Arg Ser Cys Gin Asp He Asn Glu Cys Glu His Arg Asn His 
260 265 270 

Thr Cys Thr Ser Leu Gin Thr Cys Tyr Asn Leu Gin Gly Gly Phe Lys 
275 280 285 




Cys lie Asp Pro lie Ser Cys Glu Glu Pro Tyr Leu Leu lie Gly Glu 
290 295 300 

Asn Arg Cys Met Cys Pro Ala Glu His Thr Ser Cys Arg Asp Gin Pro 
305 310 315 320 

Phe Thr lie Leu Tyr Arg Asp Met Asp Val Val Ser Gly Arg Ser Val 
325 330 335 

Pro Ala Asp lie Phe Gin Met Gin Ala Thr Thr Arg Tyr Pro Gly Ala 
340 345 350 

Tyr Tyr lie Phe Gin lie Lys Ser Gly Asn Glu Gly Arg Glu Phe Tyr 
355 360 365 

Met Arg Gin Thr Gly Pro He Ser Ala Thr Leu Val Met Thr Arg Pro 
370 375 380 

He Lys Gly Pro Arg Asp He Gin Leu Asp Leu Glu Met He Thr Val 
385 390 395 400 

Asn Thr Val He Asn Phe Arg Gly Ser Ser Val He Arg Leu Arg He 
405 4 10 4 15 

Tyr Val Ser Gin Tyr Pro Phe 
420 

SEQ ID NO. : 10 

7 6 



# 



Length : 1269 base pairs 
Type : nucleic acid 
Strandness : single 
Topology : liner 
Molecule type : cDNA to mRNA 
Sequence Description : 

CAGTGCACAA ACGGCTTTGA CCTGGACCGC CAGTCAGGAC AGTGTCTAGA TATTGATGAA 60 

TGCCGGACCA TCCCTGAGGC TTGTCGTGGG GACATGATGT GTGTCAACCA GAATGGCGGG 12 0 

TATTTGTGCA TCCCTCGAAC CAACCCAGTG TATCGAGGGC CTTACTCAAA TCCCTACTCT 180 

ACATCCTACT CAGGCCCATA CCCAGCAGCG GCCCCACCAG TACCAGCTTC CAACTACCCC 24 0 

ACGATTTCAA GGCCTCTTGT CTGCCGCTTT GGGTATCAGA TGGATGAAGG CAACCAGTGT 3 00 

GTGGATGTGG ACGAGTGTGC AACAGACTCA CACCAGTGCA ACCCTACCCA GATCTGTATC 3 60 

AACACTGAAG GAGGTTACAC CTGCTCCTGC ACCGATGGGT ACTGGCTTCT GGAAGGGCAG 42 0 

TGCCTAGATA TTGATGAATG TCGCTATGGT TACTGCCAGC AGCTCTGTGC AAATGTTCCA 4 80 

GGATCCTATT CCTGTACATG CAACCCTGGT TTCACCCTCA ACGACGATGG AAGGTCTTGC 54 0 

CAAGATGTGA ACGAGTGCGA AACTGAGAAT CCCTGTGTTC AGACCTGTGT CAACACCTAT 6 00 

GGCTCTTTCA TCTGCCGCTG TGACCC AGGA TATGAACTTG AGGAAGATGG CATTCACTGC 6 60 
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AGTGATATGG ACGAGTGCAG CTTCTCCGAG TTCCTCTGTC AACACGAGTG TGTGAACCAG 72 0 

CCGGGCTCAT ACTTCTGCTC GTGCCCTCCA GGCTACGTCC TGTTGGATGA TAACCGAAGC 780 

TGCCAGGATA TCAATGAATG TGAGCACCGA AACCACACGT GTACCTCACT GCAGACTTGC 84 0 

TACAATCTAC AAGGGGGCTT CAAATGTATT GATCCCATCA GCTGTGAGGA GCCTTATCTG 9 00 

CTGATTGGTG AAAACCGCTG TATGTGTCCT GCTGAGCACA CCAGCTGCAG AGACCAGCCA 960 

TTCACCATCC TGTATCGGGA CATGGATGTG GTGTCAGGAC GCTCCGTTCC TGCTGACATC 102 0 

TTCCAGATGC AAGCAACAAC CCGATACCCT GGTGCCTATT ACATTTTCCA GATCAAATCT 1080 

GGCAACGAGG GTCGAGAGTT CTATATGCGG CAAACAGGGC CTATCAGTGC CACCCTGGTG 1140 

ATGACACGCC CCATCAAAGG GCCTCGGGAC ATCCAGCTGG ACTTGGAGAT GATCACTGTC 12 00 

AACACTGTCA TCAACTTCAG AGGCAGCTCC GTGATCCGAC TGCGGATATA TGTGTCGCAG 12 60 

TATCCGTTC 12 69 

SEQ ID NO. : 11 
Length : 448 amino acids 
Type : amino acid 
Topology : liner 
Molecule type : protein 




Sequence Description : 

Met Pro Gly He Lys Arg He Leu Thr Val Thr He Leu Ala Leu Cys 
-23 -20 -15 -10 

Leu Pro Ser Pro Gly Asn Ala Gin Ala Gin Cys Thr Asn Gly Phe Asp 
-5 1 5 

Leu Asp Arg Gin Ser Gly Gin Cys Leu Asp He Asp Glu Cys Arg Thr 
10 15 20 25 

He Pro Glu Ala Cys Arg Gly Asp Met Met Cys Val Asn Gin Asn Gly 
30 35 40 

Gly Tyr Leu Cys He Pro Arg Thr Asn Pro Val Tyr Arg Gly Pro Tyr 
45 50 55 

Ser Asn Pro Tyr Ser Thr Pro Tyr Ser Gly Pro Tyr Pro Ala Ala Ala 
60 65 70 

Pro Pro Leu Ser Ala Pro Asn Tyr Pro Thr He Ser Arg Pro Leu He 
75 80 85 

Cys Arg Phe Gly Tyr Gin Met Asp Glu Ser Asn Gin Cys Val Asp Val 
90 95 100 105 

Asp Glu Cys Ala Thr Asp Ser His Gin Cys Asn Pro Thr Gin He Cys 
110 115 120 
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He Asn Thr Glu Gly Gly Tyr Thr Cys Ser Cys Thr Asp Gly Tyr Trp 
125 130 135 

Leu Leu Glu Gly Gin Cys Leu Asp He Asp Glu Cys Arg Tyr Gly Tyr 
140 145 150 

Cys Gin Gin Leu Cys Ala Asn Val Pro Gly Ser Tyr Ser Cys Thr Cys 
155 160 165 

Asn Pro Gly Phe Thr Leu Asn Glu Asp Gly Arg Ser Cys Gin Asp Val 
1^0 175 180 185 

Asn Glu Cys Ala Thr Glu Asn Pro Cys Val Gin Thr Cys Val Asn Thr 
190 195 200 

Tyr Gly Ser Phe He Cys Arg Cys Asp Pro Gly Tyr Glu Leu Glu Glu 
205 210 215 

Asp Gly Val His Cys Ser Asp Met Asp Glu Cys Ser Phe Ser Glu Phe 
220 225 230 

Leu Cys Gin His Glu Cys Val Asn Gin Pro Gly Thr Tyr Phe Cys Ser 
235 240 245 

Cys Pro Pro Gly Tyr He Leu Leu Asp Asp Asn Arg Ser Cys Gin Asp 
250 255 260 265 
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He Asn Glu Cys Glu His Arg Asn His Thr Cys Asn Leu Gin Gin Thr 
270 275 280 

Cys Tyr Asn Leu Gin Gly Gly Phe Lys Cys He Asp Pro He Arg Cys 
285 290 295 

Glu Glu Pro Tyr Leu Arg He Ser Asp Asn Arg Cys Met Cys Pro Ala 
300 305 310 

Glu Asn Pro Gly Cys Arg Asp Gin Pro Phe Thr He Leu Tyr Arg Asp 
3 15 320 325 

Met Asp Val Val Ser Gly Arg Ser Val Pro Ala Asp He Phe Gin Met 
330 335 340 345 

Gin Ala Thr Thr Arg Tyr Pro Gly Ala Tyr Tyr He Phe Gin He Lys 
350 355 360 

Ser Gly Asn Glu Gly Arg Glu Phe Tyr Met Arg Gin Thr Gly Pro He 
365 370 375 

Ser Ala Thr Leu Val Met Thr Arg Pro He Lys Gly Pro Arg Glu He 
380 385 390 

Gin Leu Asp Leu Glu Met He Thr Val Asn Thr Val He Asn Phe Arg 
395 400 405 

Gly Ser Ser Val lie Arg Leu Arg lie Tyr Val Ser Gin Tyr Pro Phe 




41 ° 415 420 425 

SEQ ID NO. : 12 

Length : 1344 base pairs 

Type : nucleic acid 

Strandness : single 

Topology : liner 

Molecule type : cDNA to mRNA 

Sequence Description : 

ATGCCAGGAA TAAAAAGGAT ACTCACTGTT ACCATTCTGG CTCTCTGTCT TCCAAGCCCT 6 0 
GGGAATGCAC AGGCACAGTG CACGAATGGC TTTGACCTGG ATCGCCAGTC AGGACAGTGT 120 
TTAGATATTG ATGAATGCCG AACCATCCCC GAGGCCTGCC GAGGAGACAT GATGTGTGTT 180 
AACCAAAATG GCGGGTATTT ATGCATTCCC CGGACAAACC CTGTGTATCG AGGGCCCTAC 24 0 
TCGAACCCCT ACTCGACCCC CTACTCAGGT CCGTACCCAG CAGCTGCCCC ACCACTCTCA 300 
GCTCCAAACT ATCCCACGAT CTCCAGGCCT CTTATATGCC GCTTTGGATA CCAGATGGAT 36 0 
GAAAGCAACC AATGTGTGGA TGTGGACGAG TGTGCAACAG ATTCCCACCA GTGCAACCCC 4 20 
ACCCAGATCT GCATC AATAC TGAAGGCGGG TACACCTGCT CCTGCACCGA CGGATATTGG 480 
CTTCTGGAAG GCC AGTGCTT AG AC ATT GAT GAATGTCGCT ATGGTTACTG CCAGCAGCTC 54 0 
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TGTGCGAATG TTCCTGGATC CTATTCTTGT ACATGCAACC CTGGTTTTAC CCTCAATGAG 600 
GATGGAAGGT CTTGCCAAGA TGTGAACGAG TGTGCCACCG AGAACCCCTG CGTGCAAACC 660 



TGCGTCAACA CCTACGGCTC TTTCATCTGC CGCTGTGACC CAGGATATGA ACTTGAGGAA 720 

GATGGCGTTC ATTGCAGTGA TATGGACGAG TGCAGCTTCT CTGAGTTCCT CTGCCAACAT 780 

GAGTGTGTGA ACCAGCCCGG CACATACTTC TGCTCCTGCC CTCCAGGCTA CATCCTGCTG 84 0 

GATGACAACC GAAGCTGCCA AGACATCAAC GAATGTGAGC ACAGGAACCA CACGTGCAAC 9 00 

CTGCAGCAGA CGTGCTACAA TTTACAAGGG GGCTTCAAAT GCATCGACCC CATCCGCTGT 960 

GAGGAGCCTT ATCTGAGGAT CAGTGATAAC CGCTGTATGT GTCCTGCTGA GAACCCTGGC 102 0 



TGCAGAGACC AGCCCTTTAC CATCTTGTAC CGGGACATGG ACGTGGTGTC AGGACGCTCC 1080 
GTTCCCGCTG ACATCTTCCA AATGCAAGCC ACGACCCGCT ACCCTGGGGC CTATTACATT 114 0 
TTCCAGATCA AATCTGGGAA TGAGGGCAGA GAATTTTACA TGCGGCAAAC GGGCCCCATC 12 00 



AGTGCCACCC TGGTGATGAC ACGCCCCATC AAAGGGCCCC GGGAAATCCA GCTGGACTTG 12 60 



GAAATGATCA CTGTCAACAC TGTCATCAAC TTC AGAGGC A GCTCCGTGAT CCGACTGCGG 13 20 



ATATATGTGT CGCAGTACCC ATTC 



SEQ ID NO. : 13 
Length : 2328 base pairs 
Type : nucleic acid 
Strandness : single 
Topology : liner 
Molecule type : cDNA to mRNA 
Original source 
Organism : Homo Sapiens 
Organelle : human brain 
Clone Name : human A55 
Feature 
Name/Key : CDS 
Location : 169 . . 1512 
Identification method : S 
Name/Key : sig peptide 
Location : 1 69 237 
Identification method : S 
Name/ Key : mat peptide 
Location : 2 38 .. 15 12 
Identification method : S 
Sequence Description : 

GACCCGGCGC TCTCCCCGTG TCCTCTCCAC GACTCGCTCG GCCCCTCTGG AATAAAACAC 60 

CCGCGAGCCC CGAGGGCCCA GAGGAGGCCG ACGTGCCCGA GCTCCTCCGG GGGTCCCGCC 120 

CGCGAGCTTT CTTCTCGCCT TCGCATCTCC TCCTCGCGCG TCTTGGAC ATG CCA GGA 177 

Met Pro Gly 



84 



-23 



ATA AAA AGG ATA CTC ACT GTT ACC ATT CTG GCT CTC TGT CTT CCA AGC 225 
lie Lys Arg lie Leu Thr Val Thr lie Leu Ala Leu Cys Leu Pro Ser 
-20 -15 -10 -5 



CCT GGG AAT GCA CAG GCA CAG TGC ACG AAT GGC TTT GAC CTG GAT CGC 27 3 

Pro Gly Asn Ala Gin Ala Gin Cys Thr Asn Gly Phe Asp Leu Asp Arg 
1 5 10 



CAG TCA GGA CAG TGT TTA GAT ATT GAT GAA TGC CGA ACC ATC CCC GAG 32 1 

Gin Ser Gly Gin Cys Leu Asp lie Asp Glu Cys Arg Thr lie Pro Glu 
15 20 25 



GCC TGC CGA GGA GAC ATG ATG TGT GTT AAC CAA AAT GGC GGG TAT TTA 3 69 

Ala Cys Arg Gly Asp Met Met Cys Val Asn Gin Asn Gly Gly Tyr Leu 
30 35 40 



TGC ATT CCC CGG ACA AAC CCT GTG TAT CGA GGG CCC TAC TCG AAC CCC 4 17 

Cys lie Pro Arg Thr Asn Pro Val Tyr Arg Gly Pro Tyr Ser Asn Pro 
45 50 55 60 



TAC TCG ACC CCC TAC TCA GGT CCG TAC CCA GCA GCT GCC CCA CCA CTC 4 65 

Tyr Ser Thr Pro Tyr Ser Gly Pro Tyr Pro Ala Ala Ala Pro Pro Leu 
65 70 75 



TCA GCT CCA AAC TAT CCC ACG ATC TCC AGG CCT CTT ATA TGC CGC TTT 5 13 

Ser Ala Pro Asn Tyr Pro Thr lie Ser Arg Pro Leu lie Cys Arg Phe 
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80 85 90 

GGA TAC CAG ATG GAT GAA AGC AAC CAA TGT GTG GAT GTG GAC GAG TGT 561 
Gly Tyr Gin Met Asp Glu Ser Asn Gin Cys Val Asp Val Asp Glu Cys 
95 100 105 

GCA ACA GAT TCC CAC CAG TGC AAC CCC ACC CAG ATC TGC ATC AAT ACT 6 09 

Ala Thr Asp Ser His Gin Cys Asn Pro Thr Gin lie Cys lie Asn Thr 
110 115 120 

GAA GGC GGG TAC ACC TGC TCC TGC ACC GAC GGA TAT TGG CTT CTG GAA 65 7 

Glu Gly Gly Tyr Thr Cys Ser Cys Thr Asp Gly Tyr Trp Leu Leu Glu 
125 130 135 140 

GGC CAG TGC TTA GAC ATT GAT GAA TGT CGC TAT GGT TAC TGC CAG CAG 7 05 

Gly Gin Cys Leu Asp lie Asp Glu Cys Arg Tyr Gly Tyr Cys Gin Gin 
145 150 155 

CTC TGT GCG AAT GTT CCT GGA TCC TAT TCT TGT ACA TGC AAC CCT GGT 753 
Leu Cys Ala Asn Val Pro Gly Ser Tyr Ser Cys Thr Cys Asn Pro Gly 
160 165 170 

TTT ACC CTC AAT GAG GAT GGA AGG TCT TGC CAA GAT GTG AAC GAG TGT 80 1 

Phe Thr Leu Asn Glu Asp Gly Arg Ser Cys Gin Asp Val Asn Glu Cys 
175 180 185 

GCC ACC GAG AAC CCC TGC GTG CAA ACC TGC GTC AAC ACC TAC GGC TCT 84 9 

Ala Thr Glu Asn Pro Cys Val Gin Thr Cys Val Asn Thr Tyr Gly Ser 
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190 195 200 

TTC ATC TGC CGC TGT GAC CCA GGA TAT GAA CTT GAG GAA GAT GGC GTT 89/ 
Phe lie Cys Arg Cys Asp Pro Gly Tyr Glu Leu Glu Glu Asp Gly Val 
205 210 215 220 

CAT TGC AGT GAT ATG GAC GAG TGC AGC TTC TCT GAG TTC CTC TGC CAA 945 
His Cys Ser Asp Met Asp Glu Cys Ser Phe Ser Glu Phe Leu Cys Gin 
225 230 235 

CAT GAG TGT GTG AAC CAG CCC GGC ACA TAC TTC TGC TCC TGC CCT CCA 993 
His Glu Cys Val Asn Gin Pro Gly Thr Tyr Phe Cys Ser Cys Pro Pro 
240 245 250 

GGC TAC ATC CTG CTG GAT GAC AAC CGA AGC TGC CAA GAC ATC AAC GAA 104 1 
Gly Tyr lie Leu Leu Asp Asp Asn Arg Ser Cys Gin Asp lie Asn Glu 
255 260 265 

TGT GAG CAC AGG AAC CAC ACG TGC AAC CTG CAG CAG ACG TGC TAC AAT 1089 
Cys Glu His Arg Asn His Thr Cys Asn Leu Gin Gin Thr Cys Tyr Asn 
270 275 280 

TTA CAA GGG GGC TTC AAA TGC ATC GAC CCC ATC CGC TGT GAG GAG CCT 1137 
Leu Gin Gly Gly Phe Lys Cys lie Asp Pro lie Arg Cys Glu Glu Pro 
285 290 295 300 

TAT CTG AGG ATC AGT GAT AAC CGC TGT ATG TGT CCT GCT GAG AAC CCT 1185 
Tyr Leu Arg lie Ser Asp Asn Arg Cys Met Cys Pro Ala Glu Asn Pro 
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305 3 1 0 3 15 

GGC TGC AGA GAC CAG CCC TTT ACC ATC TTG TAC CGG GAC ATG GAC GTG 1233 
Gly Cys Arg Asp Gin Pro Phe Thr lie Leu Tyr Arg Asp Met Asp Val 
320 325 330 

GTG TCA GGA CGC TCC GTT CCC GCT GAC ATC TTC CAA ATG CAA GCC ACG 1281 
Val Ser Gly Arg Ser Val Pro Ala Asp lie Phe Gin Met Gin Ala Thr 
335 340 345 

ACC CGC TAC CCT GGG GCC TAT TAC ATT TTC CAG ATC AAA TCT GGG AAT 132 9 
Thr Arg Tyr Pro Gly Ala Tyr Tyr He Phe Gin He Lys Ser Gly Asn 
350 355 360 

GAG GGC AGA GAA TTT TAC ATG CGG CAA ACG GGC CCC ATC AGT GCC ACC 137 7 
Glu Gly Arg Glu Phe Tyr Met Arg Gin Thr Gly Pro He Ser Ala Thr 
365 370 375 380 

CTG GTG ATG ACA CGC CCC ATC AAA GGG CCC CGG GAA ATC CAG CTG GAC 1425 
Leu Val Met Thr Arg Pro He Lys Gly Pro Arg Glu He Gin Leu Asp 
385 390 395 

TTG GAA ATG ATC ACT GTC AAC ACT GTC ATC AAC TTC AGA GGC AGC TCC 14 7 3 
Leu Glu Met He Thr Val Asn Thr Val lie Asn Phe Arg Gly Ser Ser 
400 405 410 

GTG ATC CGA CTG CGG ATA TAT GTG TCG CAG TAC CCA TTC TGAGCCTCGG 15 22 

Val lie Arg Leu Arg lie Tyr Val Ser Gin Tyr Pro Phe 
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415 420 425 



GCTGGAGCCT CCGACGCTGC CTCTCATTGG CACCAAGGGA CAGGAGAAGA GAGGAAATAA 15 82 



CAGAGAGAAT GAGAGCGACA CAGACGTTAG GCATTTCCTG CTGAACGTTT CCCCGAAGAG 1642 



TCAGCCCCGA CTTCCTGACT CTCACCTGTA CTATTGCAGA CCTGTCACCC TGCAGGACTT 1702 



GCCACCCCCA GTTCCTATGA TACAGTTATC AAAAAGTATT ATCATTGCTC CCCTGATAGA 17 62 



AGATTGTTGG TGAATTTTCA AGGCCTTCAG TTTATTTCCA CTATTTTCAA AGAAAATAGA 182 2 



TTAGGTTTGC GGGGGTCTGA GTCTATGTTC AAAGACTGTG AACAGCTTGC TGTCACTTCT 1882 



TCACCTCTTC CACTCCTTCT CTCACTGTGT TACTGCTTTG CAAAGACCCG GGAGCTGGCG 1942 



GGGAACCCTG GGAGTAGCTA GTTTGCTTTT TGCGTACACA GAGAAGGCTA TGTAAACAAA 2002 



CCACAGCAGG ATCGAAGGGT TTTTAGAGAA TGTGTTTCAA AACCATGCCT GGTATTTTCA 2 0 62 



ACCATAAAAG AAGTTTCAGT TGTCCTTAAA TTTGTATAAC GGTTTAATTC TGTCTTGTTC 2 122 



ATTTTGAGTA TTTTTAAAAA ATATGTCGTA GAATTCCTTC GAAAGGCCTT CAGACACATG 2 182 



CTATGTTCTG TCTTCCCAAA CCCAGTCTCC TCTCC ATTTT AGCCCAGTGT TTTCTTTGAG 2242 



GACCCCTTAA TCTTGCTTTC TTTAGAATTT TTACCC AATT GGATTGGAAT GCAGAGGTCT 2 3 02 
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CCAAACTGAT TAAATATTTG AAGAGA 2 32 

SEQ ID NO. : 14 
Length : 423 amino acids 
Type : amino acid 
Topology : liner 
Molecule type : protein 
Sequence Description : 

Gin Cys Thr Asn Gly Phe Asp Leu Asp Arg Gin Ser Gly Gin Cys Leu 
15 10 15 

Asp lie Asp Glu Cys Arg Thr lie Pro Glu Ala Cys Arg Gly Asp Met 
20 25 30 

Met Cys Val Asn Gin Asn Gly Gly Tyr Leu Cys lie Pro Arg Thr Asn 
35 40 45 

Pro Val Tyr Arg Gly Pro Tyr Ser Asn Pro Tyr Ser Thr Pro Tyr Ser 
50 55 60 

Gly Pro Tyr Pro Ala Ala Ala Pro Pro Leu Ser Ala Pro Asn Tyr Pro 
65 70 75 80 

Thr lie Ser Arg Pro Leu lie Cys Arg Phe Gly Tyr Gin Met Asp Glu 
85 90 95 

Ser Asn Gin Cys Val Asp Val Asp Glu Cys Ala Thr Asp Ser His Gin 

9 0 
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Cys Asn Pro Thr Gin lie Cys lie Asn Thr Glu Gly Gly Tyr Thr Cys 
115 120 125 

Ser Cys Thr Asp Gly Tyr Trp Leu Leu Glu Gly Gin Cys Leu Asp lie 
130 135 140 

Asp Glu Cys Arg Tyr Gly Tyr Cys Gin Gin Leu Cys Ala Asn Val Pro 
145 150 155 160 

Gly Ser Tyr Ser Cys Thr Cys Asn Pro Gly Phe Thr Leu Asn Glu Asp 
165 170 175 

Gly Arg Ser Cys Gin Asp Val Asn Glu Cys Ala Thr Glu Asn Pro Cys 
180 185 190 

Val Gin Thr Cys Val Asn Thr Tyr Gly Ser Phe He Cys Arg Cys Asp 
195 200 205 

Pro Gly Tyr Glu Leu Glu Glu Asp Gly Val His Cys Ser Asp Met Asp 
210 215 220 

Glu Cys Ser Phe Ser Glu Phe Leu Cys Gin His Glu Cys Val Asn Gin 
225 230 235 240 



Pro Gly Thr Tyr Phe Cys Ser Cys Pro Pro Gly Tyr He Leu Leu Asp 
245 250 255 



Asp Asn Arg Ser Cys Gin Asp He Asn Glu Cys Glu His Arg Asn His 
260 265 270 

Thr Cys Asn Leu Gin Gin Thr Cys Tyr Asn Leu Gin Gly Gly Phe Lys 
275 280 285 

Cys He Asp Pro He Arg Cys Glu Glu Pro Tyr Leu Arg He Ser Asp 
290 295 300 

Asn Arg Cys Met Cys Pro Ala Glu Asn Pro Gly Cys Arg Asp Gin Pro 
305 310 315 320 

Phe Thr He Leu Tyr Arg Asp Met Asp Val Val Ser Gly Arg Ser Val 
325 330 335 

Pro Ala Asp He Phe Gin Met Gin Ala Thr Thr Arg Tyr Pro Gly Ala 
340 345 350 

Tyr Tyr He Phe Gin He Lys Ser Gly Asn Glu Gly Arg Glu Phe Tyr 
355 360 365 

Met Arg Gin Thr Gly Pro lie Ser Ala Thr Leu Val Met Thr Arg Pro 
370 375 380 

He Lys Gly Pro Arg Glu lie Gin Leu Asp Leu Glu Met lie Thr Val 
385 390 395 400 
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Asn Thr Val lie Asn Phe Arg Gly Ser Ser Val lie Arg Leu Arg lie 



405 



410 



415 



Tyr Val Ser Gin Tyr Pro Phe 
420 

SEQ ID NO. : 15 

Length : 1269 base pairs 

Type : nucleic acid 

Strandness : single 

Topology : liner 

Molecule type : cDNA to mRNA 

Sequence Description : 

CAGTGCACGA ATGGCTTTGA CCTGGATCGC CAGTCAGGAC AGTGTTTAGA TATTGATGAA 6 0 
TGCCGAACCA TCCCCGAGGC CTGCCGAGGA GACATGATGT GTGTTAACCA AAATGGCGGG 120 
TATTTATGCA TTCCCCGGAC AAACCCTGTG TATCGAGGGC CCTACTCGAA CCCCTACTCG 180 
ACCCCCTACT CAGGTCCGTA CCCAGCAGCT GCCCCACCAC TCTCAGCTCC AAACTATCCC 24 0 
ACGATCTCCA GGCCTCTTAT ATGCCGCTTT GGATACCAGA TGGATGAAAG CAACCAATGT 3 00 
GTGGATGTGG ACGAGTGTGC AAC AGATTCC CACCAGTGCA ACCCCACCCA GATCTGCATC 360 
AATACTGAAG GCGGGT ACAC CTGCTCCTGC ACCGACGGAT ATTGGCTTCT GGAAGGCCAG 420 
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TGCTTAGACA TTGATGAATG TCGCTATGGT TACTGCCAGC AGCTCTGTGC GAATGTTCCT 4 80 



GGATCCTATT CTTGTACATG CAACCCTGGT TTTACCCTCA ATGAGGATGG AAGGTCTTGC 54 0 



CAAGATGTGA ACGAGTGTGC CACCGAGAAC CCCTGCGTGC AAACCTGCGT CAACACCTAC 600 



GGCTCTTTCA TCTGCCGCTG TGACCCAGGA TATGAACTTG AGGAAGATGG CGTTCATTGC 660 



AGTGATATGG ACGAGTGCAG CTTCTCTGAG TTCCTCTGCC AACATGAGTG TGTGAACCAG 72 0 



CCCGGCACAT ACTTCTGCTC CTGCCCTCCA GGCTACATCC TGCTGGATGA CAACCGAAGC 7 80 



TGCCAAGACA TCAACGAATG TGAGCACAGG AACCACACGT GCAACCTGCA GCAGACGTGC 84 0 



TACAATTTAC AAGGGGGCTT CAAATGCATC GACCCCATCC GCTGTGAGGA GCCTTATCTG 900 



AGGATCAGTG ATAACCGCTG TATGTGTCCT GCTGAGAACC CTGGCTGCAG AGACCAGCCC 9 60 



TTTACCATCT TGTACCGGGA CATGGACGTG GTGTCAGGAC GCTCCGTTCC CGCTGACATC 1020 



TTCCAAATGC AAGCCACGAC CCGCTACCCT GGGGCCTATT ACATTTTCCA GATCAAATCT 1080 



GGGAATGAGG GCAGAGAATT TTACATGCGG CAAACGGGCC CCATCAGTGC CACCCTGGTG 114 0 



ATGACACGCC CCATCAAAGG GCCCCGGGAA ATCCAGCTGG ACTTGGAAAT GATCACTGTC 1200 



AACACTGTCA TCAACTTCAG AGGC AGCTCC GTGATCCGAC TGCGGATATA TGTGTCGC AG 1260 
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Document Name : Abstract 
Abstract 

Constitution: Novel mouse and human polypeptide, a method for 

preparation of them, cDNAs encoding them, fragments capable to hybridizing 
them, a plasmid containing them, a host cell transformed with the plasmid, 
an antibody of the polypeptide, a pharmaceutical composition containing the 
polypeptide or antibody . 

Effects: The polypeptide of the present invention possess an inhibitory 

activity on proliferation of vascular smooth muscle cells. So it is useful 
for the treatment of diseases induced by abnormal proliferation of smooth 
muscle, for example, arteriosclerosis, myosarcoma. It is also useful for 
the treatment and/or prevention of many kinds of diseases as the polypeptide 
have hematopoiesis regulating activity, tissue generation/regeneration 
activity, Activin/Inhibin activity, c nemo tact ic /chemokine t ic activity , 
hemostatic and thrombolytic activity, receptor/ligand activity etc. 
Selected Fig. : none 
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Inhibitory activity of mouse A55 protein on proliferation 
of rat aortic vascular smooth muscle cells stimulated by PDGF 
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